HADOOP CLOUDERA ADMINISTRATOR

7 days left

Location
Virginia Beach, VA
Posted
May 04, 2017
Closes
Jul 28, 2017
Function
Administrative
Industry
Healthcare
Hours
Full Time
Job Description:
Job Duties

• Stewardship the overall Cloudera architecture and infrastructure. Responsible for of sizing, scaling and supporting 40+ nodes in a new implementation.
• Working with data application teams and customers to setup new Hadoop users, Linux users, Kerberos principals and LDAP integration and verification of HDFS, Hive, Pig and MapReduce access for the new users
• Cluster maintenance as well as the creation and removal of nodes using tools like Ganglia, Nagios, or Cloudera Manager Enterprise.
• Performance tuning and troubleshooting of Hadoop clusters and Hadoop MapReduce routines
• Performance tuning and troubleshooting of Spark/SQL performance when using Impala, SAP Vora or similar SQL/Spark engine.
• Mentoring and maintain Hadoop cluster job performances and capacity planning
• Monitor and maintain Hadoop cluster connectivity, security and log files
• Monitor and maintain File system and HDFS
• Ability to work with the storage, network, database, and business intelligence teams to provide data quality and high availability
• Coordinate with Unix team and vendors to install operating system updates, Hadoop updates, patches, version upgrades when required
• General DBA responsibilities such as database backup and recovery, database connectivity and security, automation of tasks, and database provisioning.
• Participate in call rotation for all products supported by the BI systems architecture team

Sentara's Business Intelligence (BI) team seeks to advance and promote the effective use of information to support Sentara's business strategy. The Business Intelligence Architect has overall responsibility and accountability for designing a holistic BI environment from ETL (extract, transform, load) through to the End User applications. Leverages foundational skills in enterprise data architecture combined with prior experience to lead enterprise strategic engagements that focus on BI, Master Data Management, Data Governance, Data Quality, Predictive Analytics and Data Warehousing. This position will lead the process of researching emerging products and technologies, evaluating their promise and readiness for use at Sentara, and working with IT partners to study business value and conduct technical tests and prototypes. Assists the Manager of Applications and Technology Architecture in providing technical direction to Sentara to ensure the highest business value of the BI development efforts, and to recommend infrastructure and methodology changes to ensure the continuing viability and contribution of our enterprise BI applications.

Education Level
Bachelor's Level Degree

Experience
Required: Business Intelligence - 10 years

Preferred: None, unless noted in the “Other” section below

License
None, unless noted in the “Other” section below

Skills
Required:

Preferred: None, unless noted in the “Other” section below

Other
Bachelor's level degree in Computer Science, Information Technology or related field. 10 years of Business Intelligence, Data Warehouse or Analytics experience required. Microsoft Certified Architect or Microsoft Certified Master: Business Intelligence Developer on Microsoft SQL Server or equivalent is preferred.

• 3 to 5 years of Hadoop administration experience, preferably using Cloudera
• 3+ years of experience on Linux, preferably RedHat/SUSE
• 1+ years of experience creating map reduce jobs and ETL jobs in Hadoop, preferably using Cloudera
• Healthcare industry experience preferred

• Experience with Hadoop architecture, administration and database management
• Experience maintaining Hadoop clusters.
• Experience sizing and scaling clusters, adding and removing nodes, provisioning resources for jobs, job maintenance and scheduling
• Strong understanding of JVM debugging, management and tuning
• Operational experience with Hadoop technologies include Ambari, Ranger, Atlas, Knox, NiFi, Kafka, Storm, Hive, Pig, MapReduce, HDFS, HBase, Accumulo, Spark
• Experience with relational databases (preferably Microsft SQL Server) and data warehouse concepts
• Experience with IBM MDM Infosphere/Datastage in a Hadoop environment.
• Familiarity with Java programming
• Familiarity with Tableau, SAP HANA or SAP BusinessObjects