Data Engineer

Reston, VA
Nov 09, 2018
Nov 12, 2018
Full Time
Overview Jellyfish, delivers digital marketing solutions across the world for a host of impressive brands from their offices in the US, Europe, and South Africa. The agency's award-winning combination of technology and talent help it to deliver paid media, UX, SEO, social media, analytics, optimization, creative and development solutions. We at Jellyfish appreciate the value of data within ours and client's business. We generate more data than ever before but we recognise that clean and meaningful data streamed at a rate which can give us a competitive edge is the key. Our Data Engineers are key to activating our innovative solutions and the role requires an individual who can work with other analysts to create solutions which integrate with our advertising technology stack. Responsibilities Responsible for all extract, transform and load (ETL) processes and the creation of applications that can connect to remote APIs. Preferably including DoubleClick, Google Analytics, AdWords and Google Tag-Manager and stream data into environments such as BigQuery Responsible for the management of multiple processes and applications, performance reporting and error checking Responsible for the management of all data created within client applications, the structure of data held and the views of data created Responsible for recommending the correct technologies to be used and in the most cost effective manner Responsible for the design and creation of data led strategies which provide clients with opportunities to leverage their data for greater insight or performance Provide thought leadership with regards to best practice and use of the google cloud platform Qualifications BS Degree Data Engineering/ BI Development/ Data Warehousing experience. Knowledge of server-less infrastructure beneficial Ability to scope a project based on a technical brief and work with the DevOps and QA teams to provide a detailed project plan including: Data Flow Diagrams for process flow Database Schemas & Normalisation Recommended software / plugins / architecture Scalable environment architecture suggestions Hosting, storage, load balancing and caching suggestions Performance considerations Security considerations Assumptions & Exclusions A complete and accurate estimate for the project Ability to assess new business and respond with a full list of targeted questions to ensure accurate estimates are created Ability to research solutions to technical problems Experience scheduling/automating scripts Experience with streaming data beneficial Experience on Linux command line and Bash scripting Experience with Git/GitHub Experience with Amazon/Google Cloud services. Experience with Dataflow, Google PubSub or other queuing software beneficial Good experience of parsing data formats such as XML/JSON and using 3rd party API's Experience with Curl / similar beneficial Solid Python programming skills. Java / other languages beneficial Strong SQL experience Experience in using Key/Value or Document Stores such as DocumentDB, BigTable, NoSQL, MongoDB, Hadoop Basic experience with Tensorflow. CloudML/Spark/SparkML Beneficial An understanding of how data can benefit the wider business, and how to translate technical requirements to non-technical stakeholders Must have experience of building robust pipelines and data environments to support database or machine learning based applications

Similar jobs

More searches like this