Java Hadoop Developer
Lead Developer Location Reston, VA Duration 12 months Description This position requires a BABS in Computer Science, Information Systems, Information Technology or related field with 10 years of prior experience in software development, Data Engineering and Business Intelligence OR equivalent experience. Following are the some of the key skills that you must have. - Proven experience designing, building and optimizing data pipelines to ingestprocess structured and unstructured datasets - Advanced Level experience (7 years ) with Java , PythonScala programming languages - Advanced level Experience (3 years ) building Real Time streaming solutions, leveraging Flume, Kafka and Apache Spark streaming - 2 years in Hadoop technologies such as MapReduce, Hive, Sqoop, Oozie, Impala and other related Big Data technologies - Experience tuning various configuration parameters for optimal performance - Advanced level experience with at least one NoSQL stores (Hbase, Cassandra, MongoDB etc) Hbase is highly preferred - Prior experience leading Big Data projects with a minimum team size of 4 developers - Must have extensive experience with Agile or Scrum methodologies. - Must have strong experience in continuous integration within DevOps environment. - Strong communication skills and self-starter mentality and the ability to think outside the box. - Rigor in high code quality, automated testing, and other engineering best practices, ability to write reusable code components Skills nice to have - Healthcare experience - Experience in WebServiceAPI integration, providing Data as a Service - Cloudera Certified Developer - Work with Data Analysts and other team members to review business requirements and translate into technical requirements - Collaborate with application architects and data solution architects - Design and Build Data Integration pipeline using Cloudera Hadoop platform - Guide other team members , resolve technical impediments and ensure code quality and standards - Transform the data to create a consumable data layer for various application uses - Support Data pipeline with bug fixes, and additional enhancements - Document Technical design , Operational Runbook etc.