Data Scientist (Text mining: understanding impact)

European Molecular Biology Laboratory (EMBL) - Literature Services Team

Contract Duration: 2 years

For more information about pay and benefits click here

Job Description
We are seeking to recruit a data scientist with text mining skills to join the Literature Services Team at the European Bioinformatics Institute (EMBL-EBI) located on the Wellcome Trust Genome Campus near Cambridge in the UK. This post is a fixed-term post to undertake text and data mining projects that support investigations into the impact of funded research and research data infrastructure.

Europe PMC is the database of life sciences abstracts and full text articles that incorporates both PubMed and PMC content, holding over 30 million abstracts and 4.2 million full text articles), and is supported by 27 funders of life sciences research, for whom we also run a public database of awarded grants. In addition to providing powerful search and retrieval mechanisms for the content such as section-level searching, we integrate the articles with ORCIDs, supporting data, funding information and other resources that provide relevant information for our users. This represents a large collection of material for data mining on which to explore the impacts of research funding and use of research data infrastructure. We are therefore looking for a versatile data scientist capable of developing production-quality text and data mining algorithms to support impact analysis.

Specific job responsibilities include:

  • Develop algorithms that mine full text research papers for organisation names and grant IDs for Europe PMC funders
  • Extend algorithms for mining database accession numbers, resource names or other means of gathering indicators of use of data resources from the literature
  • Iterative improvement of solutions, with key stakeholders, and analysis of results
  • Development of interfaces that support easy access to the results of this work

At EMBL-EBI, we help scientists realise the potential of ‘big data’ in biology by enabling them to exploit complex information to make discoveries that benefit mankind. Working for EMBL-EBI gives you an opportunity to apply your skills and energy for the greater good.

Qualifications and Experience
The successful candidate should demonstrate some or most of the following:

  • Experience of text-mining as applied to biological data resources in an academic, industrial or publishing setting;
  • Technical ability e.g. Perl, Java, R, XML parsing;
  • Flexible approach and ability to take on new skills;
  • Self starter and able to manage multiple projects;
  • Team player and good communicator

EMBL is an inclusive, equal opportunity employer offering attractive conditions and benefits appropriate to an international research organisation. The remuneration package comprises a competitive salary, a comprehensive pension scheme and health insurance, educational and other family related benefits where applicable, as well as financial support for relocation and installation.

Application Instructions
To apply please submit a covering letter and CV, with two referees, through our online system.

Additional Information
Applications are welcome from all nationalities - visa information will be discussed in more depth with applicants selected for interview.

EMBL-EBI is committed to achieving gender balance and strongly encourages applications from women, who are currently under-represented at all levels. Appointment will be based on merit alone.

This position is limited to the project duration specified.

Share this job
  Share by Email   Print this job   More sharing options
We value your feedback on the quality of our adverts. If you have a comment to make about the overall quality of this advert, or its categorisation then please send us your feedback
Advert information

Subject Area(s):



South East England