Data Engineer Job at Imbue, San Francisco, CA

K1RjVWlTTk5TcW1DcDI3Vyt2eVFEVmpQR0E9PQ==
  • Imbue
  • San Francisco, CA

Job Description

Summary

We’re a small, cross-functional team focused on building AI systems that reason and code. We care deeply about understanding how people interact with these systems and how we can use data to make them safer, smarter, and more useful .

We're looking for a Data Engineer to build and own the pipelines and data infrastructure that power our product and research efforts. Your work will directly support model training, evaluation, product analytics, and safety systems. You’ll partner closely with team members building our coding agents to make sure we’re capturing the right signals and using them well.

If you’re excited about turning messy product data into actionable insights, and building systems that can scale with our research, we’d love to get connected!

Example Projects

• Combine synthetic data generation with human annotation platforms to produce high quality data that advances our product and research roadmap.

• Design and build resilient, scalable pipelines (ETL and ELT) for batch and streaming data.

• Develop and maintain infrastructure to support self-serve analytics, experimentation, and dataset generation. Prototype, evaluate, and make “build vs buy” decisions.

• Help define and improve data modeling practices across the company, including instrumentation standards, dimensional modeling for analytics and feature stores for machine learning (ML).

• Build integrations with ML infrastructure to support training pipelines, inference logging, and model monitoring (MLOps).

• Debug pipeline failures, automate deployment processes, and improve data quality and reusability.

You are

• A strong software engineer with 5+ years of experience, ideally working with large-scale data systems.

• Experienced in designing and maintaining data pipelines and infrastructure, especially for analytics, experimentation, and ML.

• Comfortable with tools for data orchestration (Airflow, Prefect), batch or streaming processing (Spark, Ray, Flink), and event tracking and analytics (Amplitude, PostHog).

• Experienced with cloud-based infrastructure and storage (e.g., S3, GCP, Snowflake, or Redshift), and thoughtful about cost-performance tradeoffs.

• Exposure to MLOps, model serving infrastructure, or ML workflows.

• Pragmatic and principled! You know when to optimize and when to ship.

Compensation and Benefits

• Work directly on creating software with human-like intelligence.

• Generous compensation, equity, and benefits.

• B udget for self-improvement: coaching, courses, conferences, etc.

• Actively co-create and participate in a positive, intentional team culture.

• Spend time learning, reading papers, and deeply understanding prior work.

• Frequent team events, dinners, off-sites, and hanging out.

• Compensation packages are highly variable based on a variety of factors. If your salary requirements fall outside of the stated range, we still encourage you to apply. The range for this role is $170,000–$350,000 cash, $10,000–$2,000,000 in equity.

How to apply

All submissions are reviewed by a person, so we encourage you to include notes on why you're interested in working with us. If you have any other work that you can showcase (open source code, side projects, etc.), certainly include it! We know that talent comes from many backgrounds, and we aim to build a team with diverse skillsets that spike strongly in different areas.

About us

Imbue builds AI systems that reason and code, enabling AI agents to accomplish larger goals and safely work in the real world. We train our own foundation models optimized for reasoning and prototype agents on top of these models. By using these agents extensively, we gain insights into improving both the capabilities of the underlying models and the interaction design for agents.

We aim to rekindle the dream of the *personal* computer, where computers become truly intelligent tools that empower us, giving us freedom, dignity, and agency to pursue the things we love.

Job Tags

Full time,

Similar Jobs

TransForce Inc.

CDL B Driver - $25/ hr + OT after 40 hrs Job at TransForce Inc.

 ...Weekly Pay: $1,000 - $1,500 Job Details ~ Start time 9:00 AM ~ Local Home Daily ~ Touch Freight, pallet jack, hand truck and hand ~ Box Truck ~1+ years driving exp. Benefits ~ Competitive weekly pay ~ Medical, dental and vision insurance ~ Life... 

DHL Supply Chain

CDL - Class A Local Shuttle Driver - No Touch Job at DHL Supply Chain

 ...CDL - Class A Local Shuttle Driver - No Touch Richmond, VA Pay: ~$$21.00 per hour plus $1.00 night shift differential per hour; ~ Average annual earnings: $45,000 Work / Life Balance Schedule: ~ Work schedule Mon- Fri 7:00 am - 3:30 Pm & 3:30 PM - 12... 

Lingatech

.NET Full Stack Developers Future Opportunities (Pipeline Posting) Job at Lingatech

 ...Status: Not an Active Opening** Job Summary LingaTech is proactively building a pipeline of experienced .NET Full Stack Developers in anticipation of future openings with our Commonwealth and commercial clients in Central Pennsylvania. This posting is not associated... 

ESSpa Kozmetika Organic Hungarian Day Spa and Skincare Salon

Spa Therapist Job at ESSpa Kozmetika Organic Hungarian Day Spa and Skincare Salon

 ...Job Description Company Description ESSpa Kozmetika Organic Hungarian Day Spa and Skincare Salon, founded in 2002 by Hungarian Master Esthetician Eva Sztupka-Kerschbaumer, is dedicated to delivering a personalized and holistic approach to skincare. Located in Pennsylvania... 

TradeJobsWorkforce

American Airlines Reservation Agent (Remote) Job at TradeJobsWorkforce

 ...Now hiring an experienced American Airlines Reservation Agent (Remote) to book flights, modify reservations, and assist travelers remotely....  ...include competitive pay, flexible scheduling, training opportunities, a supportive work environment, and career growth potential....