- Full Time
The Royal Society of Chemistry (RSC) seeks a Data Scientist to join us on a permanent full-time (35-hour week) basis. You can be based at either our Cambridge office, or home-based within the UK if suitable, in line with Covid-19 guidance.
The Royal Society of Chemistry's Data Science team plays an important role in our mission to shape the future of the chemical sciences – for the benefit of science and humanity. The team use their knowledge to help produce reports on topics such as inclusion and diversity in the chemical sciences, to help prepare briefs to guide government policy on research funding and education, and to take part in international competitions to improve the state of the art in machine learning. They focus on projects that make a measurable difference.
The Data Scientist employs techniques and theories drawn from many fields within the broad areas of mathematics, statistics, and computer science. Specifically, the role includes an understanding of domain knowledge, along with corporate data to drive insights and find strategic opportunities for the Royal Society of Chemistry.
As a Data Scientist for the Royal Society of Chemistry, you will have expertise in machine learning and applied statistics, programming, and domain knowledge. The role is highly technical and hands-on and involves running projects to mine complex data sets to discover knowledge that is valuable to the business and requires a level of project management experience.
Your core responsibilities will include:
- Advise on and investigate the feasibility of data science solutions and ideas.
- Conduct hands-on machine learning and applied statistics projects.
- Collaborate with a diverse set of users and stakeholders with different levels of statistical knowledge to adapt approaches to new insights based on data findings.
- Clearly and objectively communicate results, as well as their associated uncertainties and limitations.
- Present complex findings in an intuitive and understandable manner.
- Extract meaning from data and to produce data products that convert raw data into a strategic asset, both in terms of chemical science data as well as with corporate data.
We are looking for:
- Qualification in machine learning, natural language processing, statistics, or another numerate discipline. Higher degree required (Ph.D. preferred).
- Comprehensive expertise in data science, preferably with a science focus.
- Experience of applied statistics and machine learning.
- Experience in handling large datasets.
- Skilled in R, including the Tidyverse, Shiny and commonly used statistical modelling/analysis techniques using and/or Python, including NumPy, SciPy and Pandas.
- Skilled in database technologies such as SQL and MongoDB.
- Expertise in Cheminformatics/Chemistry data and handling large datasets desirable.
- Natural language processing and text mining highly desirable.
- Experience of different software development methodologies including Agile techniques and continuous delivery.
- Well-organised, self-motivated team player; aptitude for project management.
- Able to clearly communicate technical concepts to a non-technical audience.
- Negotiating skills to quickly resolve customer issues.
- A drive to learn and master new technologies and techniques.
At the Royal Society of Chemistry, we support development and offer excellent benefits.
If you are interested in this opportunity, please apply before the end of our closing date.