Demo

Data Engineer

Emonics LLC
Jersey, NJ Full Time
POSTED ON 1/25/2025
AVAILABLE BEFORE 4/22/2025

We at Procal are looking for a savvy Machine Learning & Data Engineer to join our team of analytics

experts to help us extract value from our data. You will lead all the processes from data collection,

cleaning, and preprocessing, to training models and deploying them to production. On a high level

we are looking for very hands-on engineers with good experience on big data, data architecture,

machine learning, and LLM.

The ideal candidate will be passionate about artificial intelligence and stay up to date with the

latest developments in the field.

This position will be a combination of typical Data Scientist math and analytical skills, with

research, advanced business, communication, and presentation skills.

Key Responsibilities

  • Develop big data scalable solutions using Hadoop, Hive, Spark, Map-Reduce, Java, Python.
  • Design schema and data molding for NoSQL Database & Data Warehouse.
  • Develop ETL data flow and Cloud Integration to build reporting solutions.
  • Assemble large, complex data sets that meet functional / non-functional requirements.
  • Identify, design, and implement internal process improvements : automating manual

processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.

  • Build the infrastructure required for optimal extraction, transformation, and loading of data
  • from a wide variety of data sources using SQL and Spark 'big data' technologies.

  • Designs, develops, codes, and troubleshoots with consideration of upstream and
  • downstream systems and technical implications.

  • Applies knowledge of tools within the Software Development Life Cycle toolchain to improve
  • the value realized by automation.

  • Applies technical troubleshooting to break down solutions and solve technical problems of
  • basic complexity.

  • Gathers, analyzes, and draws conclusions from large, diverse data sets to identify problems
  • and contribute to decision-making in service of secure, stable application development.

  • Verifying data quality, and / or ensuring it via data cleaning.
  • Exploring and visualizing data to gain an understanding of it, then identifying differences in
  • data distribution that could affect performance when deploying the model in the real world.

  • Understanding business objectives and developing models that help to achieve them, along
  • with metrics to track their progress.

  • Managing available resources such as hardware, data, and personnel so that deadlines are
  • met.

  • Designing, developing, and researching Machine Learning systems, models, and schemes
  • Studying, transforming, and converting data science prototypes
  • Performing statistical analysis and using results to improve models.
  • Training and retraining Client systems and models as needed.
  • Analyzing the use cases of Client algorithms and ranking them by their success probability
  • Understanding when your findings can be applied to business decisions.
  • Enriching existing Client frameworks and libraries.
  • Build efficient pipeline to host LLM service in local machine.
  • Develop high scalable RAG system combining with LLM to serve daily analysis and
  • troubleshooting.

    Key Skill sets

  • Good Communication and presentation skills
  • Team player
  • Experience in R and / or Python required.
  • Proficiency with a deep learning framework such as TensorFlow or Keras.
  • Proficiency with Python and basic libraries for machine learning such as scikit-learn and
  • pandas.

  • Expertise in visualizing and manipulating big datasets.
  • Good understanding of AI / Client stack - GPUs, MLFlow, LLM models
  • Hands-on practical experience in Java, Scala and / or Python, system design, application
  • development, testing, and operational stability

  • Experience in developing, debugging, and maintaining code in a large corporate environment
  • with one or more modern programming languages and database querying languages

  • Experience across the whole Software Development Life Cycle
  • Exposure to agile methodologies such as CI / CD, Applicant Resiliency, and Security
  • Emerging knowledge of software applications and technical processes within a technical
  • discipline (e.g., cloud, artificial intelligence, machine learning, mobile, etc.

  • Knowledge of Unix shell and SQL as well as NoSQL DBs is required.
  • Experience with Linux, Spark, and Kafka.
  • Good understanding of Large Language Model from system engineering perspective.
  • Qualifications

  • MS or PhD in a relevant field (Computer Science, Engineering, Statistics, Physics, Applied
  • Math)

  • 5 years of experience with Python to analyze datasets, train , evaluate, deploy, and optimize
  • models.

  • 3 Experience with Client frameworks such as PyTorch, TensorFlow, or similar
  • 3 years of machine learning / statistical modeling data analysis tools and techniques, and
  • parameters that affect their performance experience.

  • 1 year experience working with technologies related to large language models including LLM
  • architectures, model evaluation, adapters, model customization including pre-training and

    fine-tuning techniques.

  • Proficient with design, deployment, and evaluation of LLM-powered agents and tools and
  • orchestration approaches.

  • Proficient with prompt engineering, embedding model fine tuning and retrieval method
  • evaluation and optimization approaches.

  • Master's degree in a quantitative field such as statistics, mathematics, data science,
  • business analytics, economics, finance, engineering, or computer science

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Data Engineer?

    Sign up to receive alerts about other jobs on the Data Engineer career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $92,929 - $122,443
    Income Estimation: 
    $122,257 - $154,284
    Income Estimation: 
    $90,112 - $113,166
    Income Estimation: 
    $116,765 - $144,626
    Income Estimation: 
    $116,765 - $144,626
    Income Estimation: 
    $142,836 - $179,016
    Income Estimation: 
    $142,836 - $179,016
    Income Estimation: 
    $177,911 - $222,488
    Income Estimation: 
    $73,798 - $89,311
    Income Estimation: 
    $90,112 - $113,166
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Emonics LLC

    Emonics LLC
    Hired Organization Address Windsor, CT Full Time
    Job Description : - Develop and implement machine learning models using Google Vertex AI & Python in an application of C...
    Emonics LLC
    Hired Organization Address Pasco, WA Full Time
    Must have Skills : Python for Data Science (Strong), Prompt Engineering, knowledge graph, Good To Have Skills : AWS, Job...
    Emonics LLC
    Hired Organization Address Piscataway, NJ Full Time
    Emonics is seeking to hire a skilled Network Engineer to join our team focusing on Operational Technology (OT) environme...
    Emonics LLC
    Hired Organization Address Atlanta, GA Full Time
    Short Description : Experienced web and API applications developer needed to redesign websites and build APIs, including...

    Not the job you're looking for? Here are some other Data Engineer jobs in the Jersey, NJ area that may be a better fit.

    Sr. Network Engineer

    Hudson Data, Jersey, NJ

    Sr .Net Engineer

    Hudson Data, Jersey, NJ

    AI Assistant is available now!

    Feel free to start your new journey!