Big Data Architect

GreenZone Solutions, Inc.
Washington, DC
Apr 20, 2017
Apr 21, 2017
IT, IT Architect
Full Time
GreenZone Solutions, Inc. has a need for a Big Data Architect in support of our Federal Client. The Big Data Architect will be responsible for architecture design, data process workflow and implementation of our Big Data platform. The Architect will be expected to lead and mentor data engineers to build a big data processing solution. Responsibilities:Perform architecture design, data modeling, and implementation of our Big Data platform and analytic applications for CFPB financial dataApply deep learning capability to improve understanding of user behavior and data.Develop highly scalable and extensible Big Data platform which enables collection, storage, modeling, and analysis of massive data sets from numerous channels.Defining Hadoop architectures and recommend solutions to meet business requirementsDeep knowledge of Hadoop development and implementation.Pre-processing using Hive and Pig.Leverage Spark API to recommend real-time data processing solutionsDesigning, building, installing, configuring and supporting Hadoop cluster.Translate complex functional and technical requirements into detailed design.Perform analysis of vast data stores and uncover insights.Maintain security and data privacy.Managing and deploying HBase and/or other NOSQL databasesPropose best practices/standards.Mentor data engineers on Hadoop eco-system toolset Desired Skills/Requirements: 3+ years of experience in the following areas: BA/BS in a related field Database design and large (terabyte scale) database architecture Working with massive amounts of data in a high availability environment Experience configuring and administrating Hadoop, Spark and NoSQL databases like MongoDB and MapReduce frameworks Knowledge of Massively Parallel Processing databases like Greenplum or Redshift Unit Testing as well as Black/White box testing PostgreSQL and SQL Server development experience with experience in writing and optimizing SQL Queries using T-SQL and PL/PgSQL Experience in database optimization, performance tuning, health monitoring, administration, etc. Experience working in a Linux environment Experience using Git version control Scripting in Python or Java Communication skills and ability to work with customers, senior management and other technical teams Excellent documentation skills and the ability to recommend best practices and articulate process improvements and required changes Organized with the ability to meet production deadlines Must be a US Citizen with the ability to obtain and maintain a Government Clearance Company Description: GreenZone Solutions Inc. delivers innovative, cutting-edge technology and management solutions to meet our federal and private sector clients' needs. As a fast growing company that is looking to recruit and invest in tomorrow's leaders and technical innovators, we are relentless in our pursuit to find individuals who maintain high ethical standards, believe in their community and expect excellence from themselves and those around them. Our firm offers specialized proficiency and comprehensive experience in Big Data Solutions, Business Intelligence and Data Analytics to deliver quantifiable, lasting solutions to meet our clients' most complex problems. GreenZone Solutions, Inc. is an SBA-certified 8(a) small disadvantaged business, Woman-owned Small Business (WOSB) headquartered in Arlington, Virginia.