What are the responsibilities and job description for the Data Engineer (Remote in North Carolina) position at Rocket Lawyer?
About Rocket Lawyer
We believe everyone deserves access to affordable and simple legal services. Founded in 2008, Rocket Lawyer is the largest and most widely used online legal service platform in the world. With offices in North America, South America, and Europe, Rocket Lawyer has helped over 30 million people create over 50 million legal documents, and get their legal questions answered.
We are in a unique position to enhance and expand the Rocket Lawyer platform to a scale never seen before in the company’s history, to capture audiences worldwide. We are expanding our team to take on this challenge!
About The Role
We are seeking a highly skilled and passionate Data Engineer to join our growing team focused on building and deploying cutting-edge AI/ML solutions. As a Data Engineer, you will play a crucial role in designing, building, and maintaining the data infrastructure powering the AI models for Rocket Copilot, our AI legal assistant. You will work closely with Machine Learning Engineers, Data Scientists, and Product Managers to ensure the availability of high-quality data for training, fine-tuning, and evaluating generative models. This role requires a strong understanding of data engineering principles, experience with large-scale data processing, and a passion for pushing the boundaries of AI.
We value a fun, collaborative, team-oriented work environment, where we celebrate our accomplishments.
Responsibilities
All your information will be kept confidential according to EEO guidelines.
You may request reasonable accommodations by sending an email to hr@rocketlawyer.com.
Compensation
Base salary range by location:
$100,000—$160,000 USD
By applying for this position, your data will be processed as per Rocket Lawyer Privacy Policy.
We believe everyone deserves access to affordable and simple legal services. Founded in 2008, Rocket Lawyer is the largest and most widely used online legal service platform in the world. With offices in North America, South America, and Europe, Rocket Lawyer has helped over 30 million people create over 50 million legal documents, and get their legal questions answered.
We are in a unique position to enhance and expand the Rocket Lawyer platform to a scale never seen before in the company’s history, to capture audiences worldwide. We are expanding our team to take on this challenge!
About The Role
We are seeking a highly skilled and passionate Data Engineer to join our growing team focused on building and deploying cutting-edge AI/ML solutions. As a Data Engineer, you will play a crucial role in designing, building, and maintaining the data infrastructure powering the AI models for Rocket Copilot, our AI legal assistant. You will work closely with Machine Learning Engineers, Data Scientists, and Product Managers to ensure the availability of high-quality data for training, fine-tuning, and evaluating generative models. This role requires a strong understanding of data engineering principles, experience with large-scale data processing, and a passion for pushing the boundaries of AI.
We value a fun, collaborative, team-oriented work environment, where we celebrate our accomplishments.
Responsibilities
- Design, develop, and maintain robust, scalable, and efficient data pipelines for ingesting, processing, transforming, and storing large datasets used for training and evaluating generative AI models.
- Perform data cleaning, normalization, transformation, and feature engineering to prepare data for model training. This includes handling unstructured data like text, images, and audio.
- Build and manage the data infrastructure, including data lakes, data warehouses, and databases, optimized for AI workloads.
- Implement data quality checks and monitoring systems to ensure data accuracy, completeness, and consistency.
- Contribute to the development and implementation of MLOps best practices for data management and model deployment.
- Work with GCP and Snowflake and their data and AI offering.
- Optimize data pipelines and infrastructure for performance, scalability, and cost-effectiveness.
- 5 years of python experience.
- 3 experience of leveraging technologies such as Airflow, Apache Spark.
- Experience working with large language models (LLMs), diffusion models, or other generative models.
- Experience with MLOps tools and practices.
- Strong understanding of data architectures and patterns.
- Experience with containerization technologies (e.g., Docker, Kubernetes).
- Contributions to open-source projects.
- Strong understanding of data architectures and patterns.
- Experience in DataOps implementation and support.
- Experience in MLOps implementation and support.
- Experience in building and supporting AI/ML platform.
- Comprehensive health plans (including Medical, Dental and Vision insurance for full-time employees)
- Unlimited PTO
- Competitive salary packages
- Life insurance
- Disability benefits
- Supplemental Optional Life Insurance Benefits
- FSA Options Optional
- HSA with Company Match
- 401k program with Company Match
- Fertility Assistance and Planning options
- Wellhub & ClassPass fitness platforms
- Comprehensive Pet Insurance options
- Financial Wellbeing & Student Loan Program access
- Access to additional Mental Health & Wellbeing resources
- Pre-tax Commuter/Transit Benefits
- Free Rocket Lawyer account with online access to an extensive legal documents library and brilliant licensed attorneys at discounted rates
All your information will be kept confidential according to EEO guidelines.
You may request reasonable accommodations by sending an email to hr@rocketlawyer.com.
Compensation
Base salary range by location:
- San Francisco Bay Area, CA: $124,000 - $160,000
- California (outside of San Francisco Bay Area) and Colorado: $106,000 - $139,000
- Utah, Arizona, and North Carolina: $99,000 - $131,000
$100,000—$160,000 USD
By applying for this position, your data will be processed as per Rocket Lawyer Privacy Policy.
Salary : $124,000 - $160,000