What are the responsibilities and job description for the Senior Big Data Engineer ( Urgent ) position at Jobsbridge?
Company Description
Jobs Bridge Inc is among the fastest growing IT staffing / professional services organization with its own job portal. Jobs Bridge works extremely closely with a large number of IT organizations in the most in-demand technology skill sets.
Job Description
Build Big Data Text Mining & Natural Language Processing Framework.
• Extract meaningful data from text and unstructured transcript.
• Develop Text Mining Machine Algorithms & Data Science solutions.
• Build world class high-volume real-time data ingestion frameworks and automate ingesting various data sources into Hadoop.
• Research, develop, Optimize and Innovate frameworks and related components for enterprise scale data analysis and computations.
• Develop validation frameworks, proactive monitoring solutions to detect data ingestion failures in big data platform and take appropriate remedies.
• Develop Data Adapters to ingest large volume of Unstructured, Semi Structured and Structured data from various data sources and types.
• Collaborate with people working on various technologies and ensure consistency for the data exposed through these different channels.
• Own the end-to-end development life cycle with high quality of solution/code you develop and evangelize the test driven development - (tests, code coverage, etc.)
• Follow a customer centric approach, and ensure the solutions developed actually meet the customer requirements.
• 8 years of experience in requirements analysis, design, development and testing of distributed, enterprise-class applications/platforms with particular attention to scalability and high performance, with demonstrable experience
• Experience with NLP, Elastic Search, Text Mining, Spark, HIVE, PIG, MapReduce.
• Strong Object Oriented programming experience (Java/Python preferred)
• Experience with NoSQL data bases : HBase, Mongo
Knowledge and experience with RDBMS, O-R mapping, and application of distributed caching technologies
Qualifications
NLP, Elastic Search, Text Mining, Spark, HIVE, PIG, MapReduce
Additional Information
Multiple Openings for GC/Citizen