Senior Data Scientist
Engage with internal teams such as Newsroom and Advertising, to understand IT and software needs and challenges. Use data science to solve these problems. Design and develop a Spark Streaming framework to automatically collect temporary, social and other signals on all Washington Post news articles in real time. Implement and train a novel two-layer hybrid regression model with various regression algorithms to predict popularity of news articles. Setup a GPU system from scratch and train a state-of-the-art deep learning recurrent neural network model with millions of Washington Post news articles. Design and develop a suggestions system incorporating multiple headline generation algorithms including the deep learning model. Develop a novel support vector machine based statistical model to automatically identify breaking news stories on the same events. Design and implement an automatic evaluation system incorporating the statistical model to measure breaking news services of multiple news sources. Lead the Washington Post's collaborations with research teams of U.S. universities. Publish and present novel approaches at research and industry conferences.
Bachelor's in Computer Science, Statistics or Mathematics plus 5 years of experience in the job offered or as a Software Development or Systems Engineer OR a Master's in Computer Science, Statistics or Mathematics plus 3 years of experience in the job offered or as a Software Development or Systems Engineer. Also required: 3 years of experience in Java programming; 2 years of experience with Perl, Python, Hive, or Pig; and 1 year of experience with R, Weka, SPSS, or SAS; natural language processing; and predictive modeling. All experience may be concurrent. To apply, send resume and cover letter to Thomas Grady, Attn: SDS, WP Company, LLC d/b/a The Washington Post, 1301 K Street, NW, Washington, DC 20071.