• Assist in the design and development of ETL/ELT workflows using Databricks to move and transform data efficiently.
• Maintain and enhance existing Python and SQL codebases, ensuring data integrity and pipeline reliability.
• Debug and resolve defects in data processing pipelines under the guidance of senior team members.
• Clearly document code changes, data models, and technical processes to ensure team transparency.
• Work closely with other data engineers and product teams to translate basic business needs into technical solutions.
• Familiarity with PySpark or other distributed computing frameworks.
• Familiarity with version control systems like Gitlab and project management tools like Jira.