Job opening detail :
Machine Learning Engineer / Data Scientist - Penguin Random House
posted: June 6, 2019
Offered by:
Penguin Random House LLC
Health; Dental; 401K
Full Time
New York, NY
The Data Science & Analytics group at Penguin Random House is seeking a Machine Learning Engineer or a Data Scientist.

We are an agile team of data scientists and software engineers with a wide mandate encompassing pricing systems, recommendation and personalization systems, title segmentation, supply chain, as well as data exploration and research applying novel statistical methods.

In this role, you will have an opportunity to work on a variety of high-profile projects under the mentorship of Senior Data Scientists and in collaboration with key decision makers across the organization.

Your profile:
* A bachelor's degree in mathematics, statistics, economics, computer science, business analytics, or any quantitative social science
* Relevant coursework applying advanced statistical/machine learning and predictive analysis techniques
* 2 years of professional experience in a data science role
* Intuition for mapping real world problems to relevant analytical methods, models, approaches
* Expertise in writing and maintaining stable production level code in Python (e.g., for automating data pipeline/modeling tasks)
* Solid capability in SQL for tasks such as computing aggregates and joining multiple tables
* A strong, documented desire to rapidly and continually advance skills through on-the-job and off-the-job training (e.g. via MOOCs)

Distinguishing Experience:
* Experience working with Python packages such as scikit-learn, statsmodels, pandas, or TensorFlow
* Alternatively, a good understanding of R packages such as ggplot2, rCharts, ri, dplyr, data.table, cvTools, (b)lmer, arm, lasso/glmnet, BayesTree and reshape2/tidyr
* Experience with Stan or other general-purpose modeling tools
* Experience working with cloud-based computing platforms (e.g. AWS, Google Cloud Platform)
* Experience extracting data from and building/maintaining APIs
* Experience with UX design and data visualization
* Experience building data products from the warehouse ingestion phase all the way through to the business-facing application side
* Experience with automated feature engineering and large datasets (>1TB)

Please include with your application a link to your GitHub (Bitbucket) repository for a code sample, whether it was for a Kaggle attempt, a school project, or a general open-source contribution. Standalone code samples will also be accepted.

Please apply using our online application process, and please include your résumé and cover letter with salary requirements. Full-time employees are eligible for our comprehensive benefits program.
About Our  
Penguin Random House is the leading adult and children's publishing house in North America, the United Kingdom and many other regions around the world. In publishing the best books in every genre and subject for all ages, we are committed to quality, excellence in execution, and innovation throughout the entire publishing process: editorial, design, marketing, publicity, sales, production, and distribution. Our vibrant and diverse international community of nearly 250 publishing brands and imprints include Ballantine Bantam Dell, Berkley, Clarkson Potter, Crown, DK, Doubleday, Dutton, Grosset & Dunlap, Little Golden Books, Knopf, Modern Library, Pantheon, Penguin Books, Penguin Press, Penguin Random House Audio, Penguin Young Readers, Portfolio, Puffin, Putnam, Random House, Random House Children's Books, Riverhead, Ten Speed Press, Viking, and Vintage, among others. More information can be found at
Penguin Random House values the array of talents and perspectives that a diverse workforce brings. All qualified applicants will receive consideration for employment without regard to race, national origin, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status.
Job #:
<< Job Board
home  |  contact us  |  FAQ  |  site guide  |  help