Hadoop for Administrator
Hadoop - Admin
- Set up a local CDH repository
- Perform OS-level configuration for Hadoop installation
- Install Cloudera Manager server and agents
- Install CDH using Cloudera Manager
- Add a new node to an existing cluster
- Add a service using Cloudera Manager
Manage
Maintain and modify the cluster to support day-to-day operations in the enterprise
- Rebalance the cluster
- Set up alerting for excessive disk fill
- Define and install a rack topology script
- Install a new type of I/O compression library in the cluster
- Revise YARN resource assignment based on user feedback
- Commission/decommission a node
Test
Benchmark the cluster operational metrics, test system configuration for operation and efficiency.
- Execute file system commands via HTTPS
- Efficiently copy data within a cluster/between clusters
- Create/restore a snapshot of an HDFS directory
- Get/set ACLs for a file or directory structure
- Benchmark the cluster (I/O, CPU, network)
Configure
Perform basic and advanced configuration needed to effectively administer a Hadoop cluster.
- Configure a service using Cloudera Manager
- Create an HDFS user’s home directory
- Configure NameNode HA
- Configure Resource Manager HA
- Configure a proxy for Hiveserver2/Impala
Secure
Enable relevant services and configure the cluster to meet goals defined by security policy; demonstrate knowledge of basic security practices
- Configure HDFS ACLs
- Install and configure Sentry
- Configure Hue user authorization and authentication
- Enable/configure log and query redaction
- Create encrypted zones in HDFS
Troubleshoot
Demonstrate the ability to find the root cause of a problem, optimize inefficient execution, and resolve resource contention scenarios.
- Resolve errors/warnings in Cloudera Manager
- Resolve performance problems/errors in cluster operation
- Determine the reason for application failure
- Configure the Fair Scheduler to resolve application delays
Ready to get started?
Get in touch, or to apply for demo class
Duratech Solutions
Duratech Solutions is incorporated in 2012 and has successfully operated in the global software development industry for 7 Years.
We are the leaders in Coimbatore offering Trainings in Bigdata and Data Science, we are the only training provider in Coimbatore offering Deep Learning, the highest level of Machine Learning & Artificial Intelligence Technology. Our students have got placed in various companies like IBM, Sonata Software, Deloitte, etc
Reach Us
320N,Arpee Complex, NSR Road, SaiBaba Colony, Coimbatore-641 011. Tamil Nadu, India
256, 2nd Floor Sathy Rd,DPK Complex,Sathy Main Road,Opp. to Perumal Kovil,Saravanampatti, Coimbatore, Tamil Nadu - 641035