Data Engineer II

Samsara

Remote

Full Time

Posted April 28, 2026

$102k - $154k

Not Specified

Remote

~88 people viewed this recently

Apply Now

Application opens on company website

Job Description

The role involves building and maintaining large-scale data platforms and pipelines to support AI and analytics initiatives, enabling data-driven decision-making and generative AI applications within a fast-growing, remote-friendly environment.

Key Responsibilities

Build and maintain ETL/ELT data pipelines in Databricks and Spark for analytics and AI use cases
Develop and evolve data models to support reporting, experimentation, and advanced workflows
Implement monitoring, alerts, and testing to ensure data quality, timeliness, and lineage
Support workflow orchestration with Databricks Jobs, DBT, or similar tools at scale
Design and optimize bulk data pipelines for generative AI applications in Databricks
Partner with AI engineers and data scientists to enable experimentation, model training, and deployment
Develop frameworks for data ingestion, transformation, governance, and monitoring across various systems

Requirements

2-3 years of industry experience in data engineering, with significant experience building large-scale data platforms.
Hands-on experience working with modern data technologies stack, such as Databricks, DBT, Redshift, RDS, Snowflake or similar solutions.
Proficiency in Python and SQL, with experience in designing robust ETL ELT pipelines.
Experience orchestrating data workflows at scale and enabling machine learning or AI use cases.
Strong understanding of data modeling, performance optimization, and cost-efficient infrastructure design.
Located in and authorized to work in the United States; this is a fully remote role.
Experience enabling generative AI workflows in Databricks or similar platforms.
Familiarity with vector databases, embeddings, and retrieval systems.
Experience with Salesforce, Gainsight, Gong, Outreach, or other CRM enablement tools as data sources.
Proven ability to automate repetitive tasks, improve data hygiene, and enable experimentation across GTM data use cases.
Exposure to observability, monitoring, and governance best practices for data and AI systems.
Ability to collaborate closely with AI/ML teams while driving technical excellence in data engineering.