Demo

Code Data Validation Consultant (Machine Learning & Data Processing)

U.S. Tech Solutions Inc.
San Jose, CA Full Time
POSTED ON 4/25/2025
AVAILABLE BEFORE 6/25/2025

Job Details

Job Description:

  • Join our team to enable cutting-edge AI/ML innovation by building robust data pipelines and automation tools.
  • You ll work closely with human data operators and generative AI teams to process, analyze, and optimize high-quality datasets for training machine learning models.
  • Your work will directly impact the efficiency and performance of AI systems, from automating data quality checks to designing infrastructure that scales with evolving model requirements.
  • This role is ideal for a problem-solver who thrives in fast-paced environments and enjoys bridging data engineering with machine learning.

Responsibilities:

Data Pipeline Development:

  • Design and implement Python-based automation tools to process, clean, and transform raw data for ML training.
  • Build custom scripts to streamline data ingestion and preprocessing workflows.

Quality Analysis & Reporting:

  • Conduct manual and automated quality assessments to identify high/low-impact data for model training.
  • Generate reports detailing experimental results, data effectiveness, and recommendations for improvement.

ML Model Integration:

  • Train and evaluate open-source ML models (e.g., Gemma) to assess data impact on model performance.
  • Collaborate with AI teams to refine data selection strategies based on model feedback.

Infrastructure Optimization:

  • Develop scalable solutions in Colab/Jupyter Notebooks to automate data validation and filtering.
  • Troubleshoot and debug data formatting issues (e.g., code-comment relevance, dataset consistency).

Required (Mandatory):

  • Preferred: 2-3 years in data analysis/validation/engineering, ML engineering, or automation-focused roles.
  • Bonus: PhD graduates with hands-on ML/data processing projects.

Required (Desired):

  • Exposure to Generative AI models (e.g., GPT, Llama) or large-scale datasets.
  • Bash/Shell Scripting: Ability to automate repetitive tasks.
  • Familiarity with APIs for data ingestion/processing.
  • Experience contributing to open-source projects or public GitHub repositories.
  • Knowledge of cloud services.

Skills:

  • Technical Expertise:
  • Python: Medium to Advanced proficiency (scripting, automation, data processing libraries like Pandas/NumPy).
  • Hands-on experience writing, executing and reviewing code. (Preferably using Colab/Jupyter Notebooks)
  • Data & ML Skills:
  • Experience training/fine-tuning ML models and analyzing their performance.
  • Familiarity with public data platforms (Hugging Face, GitHub) and data formats (JSON, CSV).
  • Analytical Skills.
  • Proven ability to assess data quality and build tools to automate quality checks.

Why Join This Project:

  • Impact AI innovation by shaping the data backbone of advanced ML systems.
  • Collaborate with senior data engineers and generative AI experts.
  • Flexible hybrid work environment with opportunities for growth.

Education:

  • Bachelor s degree in Computer Science, Data Science, Engineering, or related STEM field.

About US Tech Solutions:

US Tech Solutions is a global staff augmentation firm providing a wide range of talent on-demand and total workforce solutions. To know more about US Tech Solutions, please visit .

US Tech Solutions is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Salary : $55

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Code Data Validation Consultant (Machine Learning & Data Processing)?

Sign up to receive alerts about other jobs on the Code Data Validation Consultant (Machine Learning & Data Processing) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$89,551 - $118,439
Income Estimation: 
$116,726 - $151,072
Income Estimation: 
$124,724 - $161,246
Income Estimation: 
$74,161 - $98,561
Income Estimation: 
$93,716 - $124,745
Income Estimation: 
$118,976 - $146,289
Income Estimation: 
$112,672 - $149,113
Income Estimation: 
$98,475 - $115,895
Income Estimation: 
$71,122 - $96,652
Income Estimation: 
$92,929 - $122,443
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at U.S. Tech Solutions Inc.

U.S. Tech Solutions Inc.
Hired Organization Address Atlanta, GA Full Time
Job Details Job Description: We are looking for a highly qualified SAP Technical Consultant to join our team in our Atla...
U.S. Tech Solutions Inc.
Hired Organization Address Englewood, CO Full Time
Job Details Job Description: We are looking for a talented and experienced full-time Senior Motion Designer to join our ...
U.S. Tech Solutions Inc.
Hired Organization Address Atlanta, GA Full Time
Job Details Job Description: The Data Center Technical Writing Team provides highly critical documentation that supports...
U.S. Tech Solutions Inc.
Hired Organization Address Cleveland, OH Full Time
Job Details Job Description: *Note: Candidate's must be within a 35-mile footprint of the following locations: Albany NY...

Not the job you're looking for? Here are some other Code Data Validation Consultant (Machine Learning & Data Processing) jobs in the San Jose, CA area that may be a better fit.

Data Engineer

Avenue Code, Pleasanton, CA

Tech Lead (Data)

Avenue Code, Mountain View, CA

AI Assistant is available now!

Feel free to start your new journey!