Hadoop administration job in Mumbai for capgemini

Experience : 6-9 Years
Location : Mumbai
Mode : C2H (Contract based position)
Skill : Hadoop Admin
Account : Metlife

Job Description:
Metlife;

1. Responsible for implementation and ongoing administration of Hadoop
infrastructure.

2. Aligning with the engineering team to propose and deploy new hardware and
software environments required for Hadoop and to expand existing
environments.

3. Working with AD teams to setup and monitor Hadoop users. This job
includes setting adding approved Active Directory Groups and testing HDFS,
Hive, Pig and MapReduce access..

4. Cluster maintenance as well as creation and removal of nodes using tools
using Ambari.

5. Performance tuning of Hadoop clusters and Hadoop MapReduce routines.

6. Screen Hadoop cluster job performances and capacity planning

7. Monitor Hadoop cluster connectivity and security

8. Manage and review Hadoop log files.

9. File system management and monitoring.

10. HDFS support and maintenance.

11. Diligently teaming with the infrastructure, network, database,
application and business intelligence teams to guarantee high data quality
and availability.

12. Collaborating with application teams to install operating system and
Hadoop updates, patches, version upgrades when required.

13. Database backup and recovery.

14. Database connectivity and security.

15. Performance monitoring and tuning.


Hadoop Administrator Skills:
1. General operational expertise such as good troubleshooting skills,
understanding of system?s capacity, bottlenecks, basics of memory, CPU, OS,
storage, and networks.

2. Hadoop skills like HDFS, YARN + MapReduce2, Hive, HBase, Pig, Sqoop,
Oozie, ZooKeeper, Flume, Ambari, Kafka, Knox, Slider, Solr, Spark etc.

3. The most essential requirements are: They should be able to deploy Hadoop
cluster, add and remove nodes, keep track of jobs, monitor critical parts of
the cluster,

4. configure name-node high availability, schedule and configure it and take
backups.

5. Good knowledge of Linux as Hadoop runs on Linux.

6. Familiarity with open source configuration management and deployment
tools such as Ambari and Linux scripting.

7. Knowledge of Troubleshooting Core Java Applications is a plus.

Comments