EdTech Jobs
Elsevier

Senior Data Scientist I

Elsevier
🇺🇸In-Person - 2 Locations$165K–$220K/yri2h ago
Prep for this Role

Role Snapshot

Senior Data Scientist role focused on developing and deploying cutting-edge Generative AI, NLP, and machine learning solutions for Elsevier's life sciences products. The position involves leading the full lifecycle of data science projects from design through production while mentoring junior team members.

Key Responsibilities: Develop, test, and maintain production-ready Gen AI, RAG, and NLP solutions in Python, including data collection, model development, and quality assessment. Collaborate with software engineers to deploy pipelines, optimize RAG systems, preprocess multilingual data, and establish monitoring and retraining strategies for model performance.
Skills & Tools: Expert proficiency in Python, Natural Language Processing, Machine Learning, Transformer models, and Generative AI with experience in RAG pipelines, data engineering, and MLOps. Strong communication and leadership abilities, with experience mentoring junior data scientists and collaborating across cross-functional teams.
Qualifications: Master's or Ph.D. in Computer Science, Data Science, Artificial Intelligence, or related field with 5+ years of applied experience in Generative AI, NLP, and machine learning. Demonstrated expertise in production-level machine learning systems and knowledge of latest AI/ML advancements.
Location: In-Person - 2 Locations
Compensation: $165K–$220K/yr (estimated)

Job Description

Are you interested in working with data and analytics to solve problems?
 Are you interested in bringing your GenAI, ML and NLP expertise to projects?
 About our Team Data Science Life Sciences is a diverse team focusing on GenAI, ML, NLP. We mainly develop best-in-class enrichment pipelines for Elsevier’s life science .com products such as Reaxys, Embase and Pharmapendium. About the Role As a Senior Data Scientist, you will play a pivotal role in the development and deployment of cutting-edge Gen AI models and solutions. You will be responsible for building, testing, and maintaining our Gen AI, RAG and NLP solutions You will work throughout the whole life cycle of data science projects: design, implementation, production and beyond. You will deliver efficient and production-ready Python code. You will collaborate closely with developers to deploy and productionize our data science pipelines and with subject matter experts in biology and chemistry domains to validate the output. This role requires a strong foundation in Natural Language Processing (NLP), Machine Learning, Transformer models and Generative AI, as well as proficiency in Python. Responsibilities Data collection, data analysis, model development, defining quality metrics, quality assessment of models and regular presentations to stakeholders. Creating production-ready Python packages for each component of data science pipelines (such as pre-processing and model inference) and their deployment together with software engineering team Optimizing and customizing Retrieval Augmented Generation (RAG) pipelines to meet specific project requirements that involve content ingestion, machine translation, and contextualized information retrieval Ingesting, preprocessing, and transforming large-scale multilingual data to ensure high-quality inputs for downstream models. Building AI agentic models integrated with RAG pipelines. Conducting rigorous testing and evaluation of AI models to ensure high performance and reliability. Integrating data science components and performing end-to-end quality assessments. Maintaining robustness of data science pipelines against model drift and ensuring consistent output quality. Establishing reporting processes for pipeline performance and developing automated re-training strategies for existing pipelines. Collaborating with cross-functional teams to integrate AI solutions into existing products and services. Leading and managing projects with a team of data scientists and independently executing the entire small-scale projects Mentoring junior data scientists and fostering a knowledge-sharing culture within the team. Staying up-to-date with the latest advancements in AI, machine learning, and NLP technologies. 
 Requirements
 Master’s or Ph.D. in Computer Science, Data Science, Artificial Intelligence, or a related field. 5+ years of relevant applied experience in data science, with a focus on Generative AI, NLP, and machine learning. Proficiency in Python for data analysis, model development, and deployment. Strong experience with transformer models Proficiency in Generative AI technologies, including utilizing LLMs via API access, LLM evaluation tools, and prompt engineering. Knowledge of various RAG pipelines and their practical implementation. Experience building Agentic RAG systems is strong requirement. Experience with AI agent management frameworks such as LangChain, or similar tools. Experience with advanced algorithms in deep learning, neural networks, reinforcement learning, and transfer learning. Familiarity with traditional machine learning algorithms such as random forests, SVM, logistic regression, and Bayesian modelling for model building, validation, and testing. Familiarity with cloud platforms (e.g., Bedrock, AWS, Azure) for model deployment and the creation of production-ready pipelines. Proficiency in data visualization tools and techniques. Experience with version control systems (e.g., GitLab or GitHub), Jira, and working in an Agile environment. Proficient in using OpenSearch and Databricks. Excellent problem-solving and analytical skills, with strong attention to detail. Strong communication skills and the ability to work effectively in a team-oriented environment. Work in a way that works for you We promote a healthy work/life balance across the organization. We offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance and sabbaticals, we will help you meet your immediate responsibilities and your long-term goals. Flexible working hours - flexing the times when you work in the day to help you fit everything in and work when you are the most productive. About the business As a global leader in information and analytics, we help researchers and healthcare professionals advance science and improve health outcomes for the benefit of society. Building on our publishing heritage, we combine quality information and vast data sets with analytics to support visionary science and research, health education, and interactive learning, as well as exceptional healthcare and clinical practice. At Elsevier, your work contributes to the world’s grand challenges and a more sustainable future. We harness innovative technologies to support science and healthcare to partner for a better world.

Primary Location Base Pay Range: NLD Amsterdam (Radarweg) - €1,000,000. This role is covered by the Collective Labor Agreement Publishing Industry. We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120. Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here. Please read our Candidate Privacy Policy.

We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.

USA Job Seekers: EEO Know Your Rights. Elsevier is a global leader in advanced information and decision support for science and healthcare. We believe that by working together with the communities we serve, we can shape human progress to go further, happen faster, and benefit all. We support continuous discovery and uphold the highest standards of content integrity, reliability, and reproducibility so the communities we serve can advance their field of science, healthcare or innovation with confidence. By combining high-quality content with powerful analytics, we transform complexity into clarity and deliver mission-critical insights that help professionals make better decisions when it matters most. We deliver insights that help research institutions, governments, and funders achieve their goals. We help researchers discover and share knowledge, collaborate, and accelerate innovation. We help librarians provide verified, quality information to universities. We help innovators turn knowledge into new products. We help health professionals improve patient care and educators train the next generation of doctors and nurses. Connecting quality content and innovative technologies, we make progress go further and happen faster. And by championing inclusion and sustainability, we ensure progress benefits all. With 9,500 employees, over 2,300 technologists in 5 major tech hubs, and more than 60 locations across the globe, we are committed to supporting the scientific and healthcare communities around the world. We offer a diverse range of opportunities across technology, commercial, business, and early career jobs. If you are looking for a career that inspires progress in science, innovation and health, and allows you to grow every day, find your team at Elsevier. Elsevier is part of RELX Group. Let’s shape progress together. Join us. elsevier.com/about/careers