• Building Databases and Pipelines: Developing databases, data lakes, and data ingestion pipelines to deliver datasets for various projects
• End-to-End Solutions: Designing, developing, and deploying comprehensive solutions for data and data science models, ensuring usability for both data scientists and non-technical users. This includes following best engineering and data science practices
• Scalable Solutions: Developing and maintaining scalable data and machine learning solutions throughout the data lifecycle, supporting the code and infrastructure for databases, data pipelines, metadata, and code management
• Stakeholder Engagement: Collaborating with stakeholders across various departments, including data platforms, architecture, development, and operational teams, as well as addressing data security, privacy, and third-party coordination
• Experience in Data Engineering, SQL, ETL(data validation + data mapping + exception handling) 4+ years
• Hands-on experience with Databricks 2+ years
• Experience with Python
• Experience with Power BI is nice to have
• Experience with AWS (e.g. S3, Redshift, Athena, Glue, Lambda, etc.)
• Knowledge of the Energy industry (e.g. energy trading, utilities, power systems etc.) would be a plus
• Experience with Geospatial data would be a plus
• At least an Upper-Intermediate level of English