Data Engineer, Python, AWS, Data Pipelines

Remote · USA Full-time New today

Job Description:

Design and maintain scalable ETL/ELT pipelines in Python, integrating structured and semi-structured data (JSON, CSV, XML) across Snowflake, MongoDB, Postgres, and AWS services (S3, Glue, Lambda, EC2, EMR, Redshift, RDS).
Build and optimize transformation and orchestration workflows using DBT, Airflow, Prefect, or Dagster.
Implement data governance, quality checks, and security best practices throughout data pipelines.
Extend and optimize the Elastic Hierarchy framework to harmonize financial data from various systems.
Collaborate with analysts, ML engineers, and product teams to deliver business-ready datasets and data solutions.

Requirements:

Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field.
Strong Python programming skills with experience in data processing and automation.
Proven experience with the AWS data ecosystem, including S3, Glue, Lambda, EMR, EC2, Redshift, and RDS.
Hands-on experience with Snowflake, MongoDB, and Postgres databases.
Proficiency with DBT or similar data transformation tools, and with orchestration frameworks such as Airflow, Prefect, or Dagster.
Knowledge of data mapping, attribution, or reconciliation (experience in financial services is a strong plus).
Understanding of hybrid/on-premise deployment models within enterprise environments.
Excellent English communication skills and ability to collaborate effectively within distributed teams.

Benefits: Apply tot his job Apply To this Job

Related roles