AWS Data Pipeline Development need AI Software Development

Contact person: AWS Data Pipeline Development

Phone:Show

Email:Show

Location: India

Budget: Recommended by industry experts

Time to start: As soon as possible

Project description:
"I need a production-ready data pipeline that can ingest roughly 5–7 GB of new transactions every day, land them in Amazon S3, transform them with AWS Glue, and store the refined output as partitioned Parquet files ready for analytics.

The pipeline must pull data from three independent sources—multiple MySQL databases, external APIs, and an SFTP drop—then orchestrate every step with Apache Airflow. Resilient scheduling, back-fill capability, and on-failure alerting are essential so that business dashboards in Power BI always have reliable, up-to-date information.

Key expectations
• End-to-end Airflow DAGs (Python) that manage extraction, loading to S3, Glue transformations, and catalog updates
• Glue ETL code that applies basic cleansing, schema validation, and Parquet conversion while keeping data quality top of mind
• Secure, scalable AWS setup (IAM roles, encryption at rest/in transit, sensible partitioning) that others on my team can extend
• Clear, step-by-step documentation covering deployment, scheduling, and recovery procedures

Acceptance criteria
— Data from all three sources is landed daily in S3 and appears in the Glue Data Catalog as Parquet partitions without manual intervention.
— DAG success rate stays above 99 % over a seven-day test window, with automatic retries and notifications.
— A sample Athena query on the final Parquet set returns record counts matching source systems within agreed tolerances.

If you have a proven track record with S3, Glue, Airflow, and Parquet optimisation, let’s talk details and timelines." (client-provided description)


Matched companies (7)

...

El Codamics

El Codamics – Company Preview About Us El Codamics is a Coimbatore-based software development firm helping startups, enterprises, and global clie… Read more

...

Junkies Coder

Junkies Coder is a leading technology solution provider across 15 countries and 50+ Rockstar Developers is our strength, We're specializing in web de… Read more

...

HJP Media

I am founder and CEO of HJP Media. The fastest growing AI digital solutions company in the world, offering innovative, AI powered digital marketing a… Read more

...

April Innovations

April Innovations is one of the leading Enterprise Software Development companies in Mumbai, with clients being serviced in the USA, UK, and India. T… Read more

...

Haven Futures

We Build any kind of Software and Provide wide range of tech solutions.

...

JanakiBhuvi Tech Labs Private Limited

Delivering Future-Ready Digital Solutions in Web Development, E-commerce, Logo Design, and Digital Marketing. We believe innovation is key to navigat… Read more

...

Knowforth Tech

Empowering Businesses with Tailored Software & AI Solutions.