Demo

Data Engineer

The AI Institute
Cambridge, MA Full Time
POSTED ON 1/29/2025
AVAILABLE BEFORE 3/28/2025

Our Mission

Our mission is to solve the most important and fundamental challenges in AI and Robotics to enable future generations of intelligent machines that will help us all live better lives.


Data Engineers will work cross-functionally, creating new technology to support software development for robots. If you have a passion for developing data collection and processing infrastructure for robots and robotic learning, you will want to join us! We are onsite in our new Cambridge, MA office where we are building a collaborative and exciting new organization.

\n


Responsibilities
  • Work collaboratively with research scientists and software engineers on software development for a range of different robotic platforms.
  • Develop and maintain our data storage solutions and data pipelines in cloud and on-premise infrastructure.
  • Use Python and Terraform to develop and scale cloud-native data stores.
  • Build event- and batch-driven ingestion systems for machine learning and R&D.
  • Write and maintain user guides for internally developed tools.
  • Create and use systems to clean, integrate, or fuse datasets to produce data products.
  • Establish and monitor data integrity and quality through visualization, profiling, and statistical tools.
  • Perform updates, migrations, and administration tasks for data systems.
  • Develop and implement data governance and data retention strategies.


Requirements
  • BS/MS in computer science, robotics, or equivalent experience.
  • 6 years of experience in a data engineering, software engineering, DevOps, or MLOps role.
  • Strong experience building event-driven data ingestion systems.
  • Strong experience with distributed data/computing tools, such as Spark, Ray, EMR, Dataproc, Dask, or Pandas on Spark.
  • Strong experience with ETL design and implementations in the context of large, multimodal, distributed datasets.
  • Strong experience with workflow orchestration tools, such as Airflow, Argo Workflows, Cloud Composer, MWAA, Step Functions, or Prefect.
  • Demonstrated experience building containerized applications using tools and frameworks such as Docker, Docker-compose, Podman, or OCI.
  • Demonstrated experience with schema management and schema evolution.
  • Demonstrated experience with databases and data storage solutions, such as Google Cloud Storage (GCS), S3, BigQuery, NoSQL and/or SQL.
  • Experience with container orchestration tools, such as Kubernetes, GKE, EKS, or AKS.
  • Experience with UNIX/Linux including basic commands and shell scripting.


Bonus (Not Required)
  • Associate- or Professional-level GCP certifications.
  • 3 years of experience working on time-series data and streaming applications.
  • 3 years of experience with NoSQL implementation such as Mongo, Cassandra, DynamoDB, Datastore, or BigTable.
  • 3 years of experience working with on-prem compute and storage appliances.
  • 3 years of experience with data streaming tools, such as Kafka, Flink, Kinesis, Beam, Spark Streaming, or Dataflow.
  • 2 years of experience customizing package managers or build tools, such as Make, Poetry, or Bazel.
  • 2 years of experience with Infrastructure as Code tools such as Terraform, Go CDK, or AWS CDK.
  • 2 years of experience using data quality tools, such as great-expectations, or Cerberus.


\n

We provide equal employment opportunities to all employees and applicants for employment and prohibit discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Data Engineer?

Sign up to receive alerts about other jobs on the Data Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$114,981 - $143,201
Income Estimation: 
$129,640 - $165,363
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$143,391 - $179,890
Income Estimation: 
$67,172 - $106,823
Income Estimation: 
$87,954 - $124,905
Income Estimation: 
$54,658 - $80,222
Income Estimation: 
$85,711 - $119,978

Sign up to receive alerts about other jobs with skills like those required for the Data Engineer.

Click the checkbox next to the jobs that you are interested in.

  • Computer Simulation Skill

    • Income Estimation: $83,633 - $115,564
    • Income Estimation: $88,239 - $107,750
  • Cost Estimation Skill

    • Income Estimation: $80,855 - $109,590
    • Income Estimation: $78,752 - $113,368
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at The AI Institute

The AI Institute
Hired Organization Address Cambridge, MA Full Time
Our Mission Our mission is to solve the most important and fundamental challenges in AI and Robotics to enable future ge...
The AI Institute
Hired Organization Address Cambridge, MA Full Time
Our mission is to solve the most important and fundamental challenges in AI and Robotics, enabling future generations of...
The AI Institute
Hired Organization Address Cambridge, MA Full Time
Our mission is to solve the most important and fundamental challenges in AI and Robotics to enable future generations of...
The AI Institute
Hired Organization Address Cambridge, MA Full Time
Our Mission Our mission is to solve the most important and fundamental challenges in AI and Robotics to enable future ge...

Not the job you're looking for? Here are some other Data Engineer jobs in the Cambridge, MA area that may be a better fit.

Data Engineer

Catalytic Data Science, BOSTON, MA

Sr Data Engineer (Life and Annuity)

NTT DATA Group Corporation, Boston, MA

AI Assistant is available now!

Feel free to start your new journey!