|Job Type:||Full Time|
Big Data engineer/developer (Must have design Real-time solutions using Apache Spark Streaming).
Infosys is a global leader in consulting, technology, outsourcing and next-generation services. We enable clients, in more than 50 countries, to stay a step ahead of emerging business trends and outperform the competition. We help them transform and thrive in a changing world by co-creating breakthrough solutions that combine strategic insights and execution excellence.
Visit www.infosys.com to see how Infosys (NYSE: INFY), with US$ 8.7 billion in annual revenues and 190,000+ employees, is helping enterprises renew themselves while also creating new avenues to generate value.
Short Description: You will be engaging with key stakeholders and providing knowledge, experience and thought leadership in big data development and engineering solution implementation.
Roles and Responsibilities:
Hadoop data engineer/developer should have minimum of 4-8 years of total IT experience with at least 2 big data project development and production deployment experience in data lake environment. Must have Java, Shell script programming experience.
Candidate with any big data developer certification will be an added advantage.
Candidate will be responsible for:
Quick POC implementation for technical feasibility
Generic Framework component development
Application development and unit testing
Design documentation for the sprint tasks
Run book and deployment plan preparation for sprint task
Production deployment and handover to operational team
Candidate should have:
Good understanding of HDP/HDF Hadoop components
Good experience in handling data loading from disparate data sources
Good experience in handling different data formats
Good experience in handling different data load patterns in Hadoop such as – full data, incremental data and delta data handling
Good experience in data acquisition using HDP native tools like Sqoop, Flume, Hadoop copy etc or using any custom built framework using any other Hadoop native tools
Good experience in writing complex map reduce and spark programs for data transformations
Good experience in writing complex Hive hqls, and custom hive udfs
Good experience in writing complex Oozie workflows, coordinators and bundles
Good experience in Shell script/Python
Good experience in Java and REST web services
Good knowledge of any enterprise scheduler like (control-m)
Good knowledge on the performance tuning aspects in Hadoop components
Good to have:
- Hortonworks big data developer certification
- Experience in Hortonworks Data Flow or Apache Nif, FALCON data pipeline, ATLAS metadata management, PHOENIX, KAFKA, DROOLS, HBase, Solr
- Experience in streaming applications
Infosys is an equal opportunity employer and positively encourages applications from suitably qualified and eligible candidates regardless of gender or other attribute covered by equal opportunity legislation.
Please note in order to protect the interest of all parties involved in the recruitment process, Infosys does not accept any unsolicited resumes from third party vendors. In the absence of a signed agreement any submission will be deemed as non-binding and Infosys explicitly reserves the right to pursue and hire the submitted profile. All recruitment activity must be coordinated through the Talent Acquisition department.
EOE/Minority/Female/Veteran/Disabled/Sexual Orientation/Gender Identity