Senior Data Engineer

São Paulo Remote

We are seeking a highly skilled and motivated Senior Data Engineer to play a key role in our significant data platform migration project. You will be responsible for designing, developing, and optimizing data pipelines and transformations as we transition from our current Airflow, PySpark, Athena, and BigQuery environment to a modern stack built on Airflow, dbt, and Snowflake. This role requires a strong understanding of data warehousing principles, excellent SQL and Python skills, and a proven ability to deliver robust and scalable data solutions.

 

Your Responsibilities


● Data Migration & Pipeline Development:
Design, develop, and implement efficient and reliable data pipelines to migrate data from PySpark/Athena/BigQuery to DBT/Snowflake.
Translate complex data requirements into actionable dbt models and transformations within Snowflake.
Build and maintain Airflow DAGs for orchestrating data ingestion, transformation, and loading processes.
Optimize existing data pipelines for performance, scalability, and cost efficiency in the new Snowflake environment.
● Data Modeling & Transformation:
Develop and maintain robust data models in Snowflake using dbt, adhering to best practices for data warehousing and analytics.
Write complex SQL queries for data extraction, transformation, and loading.
Ensure data quality, accuracy, and consistency throughout the migration and ongoing data operations.
● Troubleshooting & Optimization:
Identify, diagnose, and resolve data-related issues, performance bottlenecks, and data discrepancies.
Proactively monitor data pipelines and systems to ensure smooth operation and data availability.
Implement performance tuning strategies within Snowflake and dbt to optimize query execution and resource utilization.
● Collaboration & Documentation:
Collaborate closely with Lead Data Engineers, Data Analysts, and other stakeholders to understand data needs and deliver effective solutions.
Contribute to the development and maintenance of comprehensive technical documentation for data pipelines, models, and processes.
Participate in code reviews and contribute to the team's adherence to coding standards and best practices.

 

Requirements

Qualifications


● Bachelor's or Master's degree in Computer Science, Engineering, or a related quantitative field.
● 3+ years of experience in data engineering, with a focus on building and maintaining scalable data pipelines.
● Solid experience with data migration projects and working with large datasets.
● Strong hands-on experience with Snowflake, including data loading, querying, and performance optimization.
● Proficiency in dbt (data build tool) for data transformation and modeling.
● Proven experience with Apache Airflow for scheduling and orchestrating data workflows.
● Expert-level SQL skills, including complex joins, window functions, and performance tuning.
● Proficiency in Python for data manipulation, scripting, and automation for edge cases
● Familiarity with PySpark, AWS Athena, and Google BigQuery (source systems).
● Understanding of data warehousing concepts, dimensional modeling, and ELT principles.
● Knowledge of building CI/CD pipelines for code deployment
● Experience with version control systems (e.g., Github).
● Excellent problem-solving, analytical, and communication skills.
● Ability to work independently and as part of a collaborative team in an agile environment.
● Must speak and write in English fluently; Effective communicator