Job Type:Full Time

Big Data engineer/developer (Must have design Real-time solutions using Apache Spark Streaming).

About Us:

Infosys is a global leader in consulting, technology, outsourcing and next-generation services. We enable clients, in more than 50 countries, to stay a step ahead of emerging business trends and outperform the competition. We help them transform and thrive in a changing world by co-creating breakthrough solutions that combine strategic insights and execution excellence.

Visit to see how Infosys (NYSE: INFY), with US$ 8.7 billion in annual revenues and 190,000+ employees, is helping enterprises renew themselves while also creating new avenues to generate value.

Short Description: You will be engaging with key stakeholders and providing knowledge, experience and thought leadership in big data development and engineering solution implementation.

Roles and Responsibilities:

Candidate Experience:

 Hadoop data engineer/developer should have minimum of 4-8 years of total IT experience with at least 2 big data project development and production deployment experience in data lake environment. Must have Java, Shell script programming experience.

 Candidate with any big data developer certification will be an added advantage.

Candidate will be responsible for:

 Quick POC implementation for technical feasibility

 Generic Framework component development

 Application development and unit testing

 Design documentation for the sprint tasks

 Run book and deployment plan preparation for sprint task

 Production deployment and handover to operational team

Candidate should have:

 Good understanding of HDP/HDF Hadoop components

 Good experience in handling data loading from disparate data sources

 Good experience in handling different data formats

 Good experience in handling different data load patterns in Hadoop such as – full data, incremental data and delta data handling

 Good experience in data acquisition using HDP native tools like Sqoop, Flume, Hadoop copy etc or using any custom built framework using any other Hadoop native tools

 Good experience in writing complex map reduce and spark programs for data transformations

 Good experience in writing complex Hive hqls, and custom hive udfs

 Good experience in writing complex Oozie workflows, coordinators and bundles

 Good experience in Shell script/Python

 Good experience in Java and REST web services

 Good knowledge of any enterprise scheduler like (control-m)

 Good knowledge on the performance tuning aspects in Hadoop components

Good to have:

- Hortonworks big data developer certification

- Experience in Hortonworks Data Flow or Apache Nif, FALCON data pipeline, ATLAS metadata management, PHOENIX, KAFKA, DROOLS, HBase, Solr

- Experience in streaming applications

Infosys is an equal opportunity employer and positively encourages applications from suitably qualified and eligible candidates regardless of gender or other attribute covered by equal opportunity legislation.

Please note in order to protect the interest of all parties involved in the recruitment process, Infosys does not accept any unsolicited resumes from third party vendors. In the absence of a signed agreement any submission will be deemed as non-binding and Infosys explicitly reserves the right to pursue and hire the submitted profile. All recruitment activity must be coordinated through the Talent Acquisition department.

EOE/Minority/Female/Veteran/Disabled/Sexual Orientation/Gender Identity