The role involves building and maintaining large-scale data platforms and pipelines to support AI and analytics initiatives, enabling data-driven decision-making and generative AI applications within a fast-growing, remote-friendly environment.
Key Responsibilities
Build and maintain ETL/ELT data pipelines in Databricks and Spark for analytics and AI use cases
Develop and evolve data models to support reporting, experimentation, and advanced workflows
Implement monitoring, alerts, and testing to ensure data quality, timeliness, and lineage
Support workflow orchestration with Databricks Jobs, DBT, or similar tools at scale
Design and optimize bulk data pipelines for generative AI applications in Databricks
Partner with AI engineers and data scientists to enable experimentation, model training, and deployment
Develop frameworks for data ingestion, transformation, governance, and monitoring across various systems
Requirements
2-3 years of industry experience in data engineering, with significant experience building large-scale data platforms.
Hands-on experience working with modern data technologies stack, such as Databricks, DBT, Redshift, RDS, Snowflake or similar solutions.
Proficiency in Python and SQL, with experience in designing robust ETL ELT pipelines.
Experience orchestrating data workflows at scale and enabling machine learning or AI use cases.
Strong understanding of data modeling, performance optimization, and cost-efficient infrastructure design.
Located in and authorized to work in the United States; this is a fully remote role.
Experience enabling generative AI workflows in Databricks or similar platforms.
Familiarity with vector databases, embeddings, and retrieval systems.
Experience with Salesforce, Gainsight, Gong, Outreach, or other CRM enablement tools as data sources.
Proven ability to automate repetitive tasks, improve data hygiene, and enable experimentation across GTM data use cases.
Exposure to observability, monitoring, and governance best practices for data and AI systems.
Ability to collaborate closely with AI/ML teams while driving technical excellence in data engineering.
Benefits & Perks
Annual Base Salary ranging from 101,745 USD to 153,900 USD
Fully remote work opportunity (excluding certain metro areas)
Flexible working model supporting in-person, hybrid, or remote work
Initial RSU grant with no vesting cliff and ongoing refresh opportunities
Performance-based bonus variable pay
Equity for eligible roles
Comprehensive health and parental leave plans
Professional development stipend
Ready to Apply?
Join Samsara and make an impact in renewable energy