Tag: EU

Published paper: benchmarking resistance gene identification

February 7, 2019

Since F1000Research uses a somewhat different publication scheme than most journals, I still haven’t understood if this paper is formally published after peer review, but I start to assume it is. There have been very little changes since the last version, so hence I will be lazy and basically repost what I wrote in April when the first version (the “preprint”) was posted online. The paper (1) is the result of a workshop arranged by the JRC in Italy in 2017. It describes various challenges arising from the process of designing a benchmark strategy for bioinformatics pipelines in the identification of antimicrobial resistance genes in next generation sequencing data.

The paper discusses issues about the benchmarking datasets used, testing samples, evaluation criteria for the performance of different tools, and how the benchmarking dataset should be created and distributed. Specially, we address the following questions:

How should a benchmark strategy handle the current and expanding universe of NGS platforms?
What should be the quality profile (in terms of read length, error rate, etc.) of in silico reference materials?
Should different sets of reference materials be produced for each platform? In that case, how to ensure no bias is introduced in the process?
Should in silico reference material be composed of the output of real experiments, or simulated read sets? If a combination is used, what is the optimal ratio?
How is it possible to ensure that the simulated output has been simulated “correctly”?
For real experiment datasets, how to avoid the presence of sensitive information?
Regarding the quality metrics in the benchmark datasets (e.g. error rate, read quality), should these values be fixed for all datasets, or fall within specific ranges? How wide can/should these ranges be?
How should the benchmark manage the different mechanisms by which bacteria acquire resistance?
What is the set of resistance genes/mechanisms that need to be included in the benchmark? How should this set be agreed upon?
Should datasets representing different sample types (e.g. isolated clones, environmental samples) be included in the same benchmark?
Is a correct representation of different bacterial species (host genomes) important?
How can the “true” value of the samples, against which the pipelines will be evaluated, be guaranteed?
What is needed to demonstrate that the original sample has been correctly characterised, in case real experiments are used?
How should the target performance thresholds (e.g. specificity, sensitivity, accuracy) for the benchmark suite be set?
What is the impact of these performance thresholds on the required size of the sample set?
How can the benchmark stay relevant when new resistance mechanisms are regularly characterized?
How is the continued quality of the benchmark dataset ensured?
Who should generate the benchmark resource?
How can the benchmark resource be efficiently shared?

Of course, we have not answered all these questions, but I think we have come down to a decent description of the problems, which we see as an important foundation for solving these issues and implementing the benchmarking standard. Some of these issues were tackled in our review paper from last year on using metagenomics to study resistance genes in microbial communities (2). The paper also somewhat connects to the database curation paper we published in 2016 (3), although this time the strategies deal with the testing datasets rather than the actual databases. The paper is the first outcome of the workshop arranged by the JRC on “Next-generation sequencing technologies and antimicrobial resistance” held October 4-5 2017 in Ispra, Italy. You can find the paper here (it’s open access).

On another note, the new paper describing the UNITE database (4) has now got a formal issue assigned to it, as has the paper on tandem repeat barcoding in fungi published in Molecular Ecology Resources last year (5).

References and notes

Angers-Loustau A, Petrillo M, Bengtsson-Palme J, Berendonk T, Blais B, Chan KG, Coque TM, Hammer P, Heß S, Kagkli DM, Krumbiegel C, Lanza VF, Madec J-Y, Naas T, O’Grady J, Paracchini V, Rossen JWA, Ruppé E, Vamathevan J, Venturi V, Van den Eede G: The challenges of designing a benchmark strategy for bioinformatics pipelines in the identification of antimicrobial resistance determinants using next generation sequencing technologies. F1000Research, 7, 459 (2018). doi: 10.12688/f1000research.14509.1

Bengtsson-Palme J, Larsson DGJ, Kristiansson E: Using metagenomics to investigate human and environmental resistomes. Journal of Antimicrobial Chemotherapy, 72, 2690–2703 (2017). doi: 10.1093/jac/dkx199

Bengtsson-Palme J, Boulund F, Edström R, Feizi A, Johnning A, Jonsson VA, Karlsson FH, Pal C, Pereira MB, Rehammar A, Sánchez J, Sanli K, Thorell K: Strategies to improve usability and preserve accuracy in biological sequence databases. Proteomics, 16, 18, 2454–2460 (2016). doi: 10.1002/pmic.201600034

Nilsson RH, Larsson K-H, Taylor AFS, Bengtsson-Palme J, Jeppesen TS, Schigel D, Kennedy P, Picard K, Glöckner FO, Tedersoo L, Saar I, Kõljalg U, Abarenkov K: The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications. Nucleic Acids Research, 47, D1, D259–D264 (2019). doi: 10.1093/nar/gky1022

Wurzbacher C, Larsson E, Bengtsson-Palme J, Van den Wyngaert S, Svantesson S, Kristiansson E, Kagami M, Nilsson RH: Introducing ribosomal tandem repeat barcoding for fungi. Molecular Ecology Resources, 19, 1, 118–127 (2019). doi: 10.1111/1755-0998.12944

New preprint: benchmarking resistance gene identification

April 16, 2018

This weekend, F1000Research put online the non-peer-reviewed version of the paper resulting from a workshop arranged by the JRC in Italy last year (1). (I will refer to this as a preprint, but at F1000Research the line is quite blurry between preprint and published paper.) The paper describes various challenges arising from the process of designing a benchmark strategy for bioinformatics pipelines (2) in the identification of antimicrobial resistance genes in next generation sequencing data.

How should a benchmark strategy handle the current and expanding universe of NGS platforms?
What should be the quality profile (in terms of read length, error rate, etc.) of in silico reference materials?
Should different sets of reference materials be produced for each platform? In that case, how to ensure no bias is introduced in the process?
Should in silico reference material be composed of the output of real experiments, or simulated read sets? If a combination is used, what is the optimal ratio?
How is it possible to ensure that the simulated output has been simulated “correctly”?
For real experiment datasets, how to avoid the presence of sensitive information?
Regarding the quality metrics in the benchmark datasets (e.g. error rate, read quality), should these values be fixed for all datasets, or fall within specific ranges? How wide can/should these ranges be?
How should the benchmark manage the different mechanisms by which bacteria acquire resistance?
What is the set of resistance genes/mechanisms that need to be included in the benchmark? How should this set be agreed upon?
Should datasets representing different sample types (e.g. isolated clones, environmental samples) be included in the same benchmark?
Is a correct representation of different bacterial species (host genomes) important?
How can the “true” value of the samples, against which the pipelines will be evaluated, be guaranteed?
What is needed to demonstrate that the original sample has been correctly characterised, in case real experiments are used?
How should the target performance thresholds (e.g. specificity, sensitivity, accuracy) for the benchmark suite be set?
What is the impact of these performance thresholds on the required size of the sample set?
How can the benchmark stay relevant when new resistance mechanisms are regularly characterized?
How is the continued quality of the benchmark dataset ensured?
Who should generate the benchmark resource?
How can the benchmark resource be efficiently shared?

Of course, we have not answered all these questions, but I think we have come down to a decent description of the problems, which we see as an important foundation for solving these issues and implementing the benchmarking standard. Some of these issues were tackled in our review paper from last year on using metagenomics to study resistance genes in microbial communities (3). The paper also somewhat connects to the database curation paper we published in 2016 (4), although this time the strategies deal with the testing datasets rather than the actual databases. The paper is the first outcome of the workshop arranged by the JRC on “Next-generation sequencing technologies and antimicrobial resistance” held October 4-5 last year in Ispra, Italy. You can find the paper here (it’s open access).

References and notes

Angers-Loustau A, Petrillo M, Bengtsson-Palme J, Berendonk T, Blais B, Chan KG, Coque TM, Hammer P, Heß S, Kagkli DM, Krumbiegel C, Lanza VF, Madec J-Y, Naas T, O’Grady J, Paracchini V, Rossen JWA, Ruppé E, Vamathevan J, Venturi V, Van den Eede G: The challenges of designing a benchmark strategy for bioinformatics pipelines in the identification of antimicrobial resistance determinants using next generation sequencing technologies. F1000Research, 7, 459 (2018). doi: 10.12688/f1000research.14509.1

You may remember that I hate the term “pipeline” for bioinformatics protocols. I would have preferred if it was called workflows or similar, but the term “pipeline” has taken hold and I guess this is a battle where I have essentially lost. The bioinformatics workflows will be known as pipelines, for better and worse.

Bengtsson-Palme J, Larsson DGJ, Kristiansson E: Using metagenomics to investigate human and environmental resistomes. Journal of Antimicrobial Chemotherapy, 72, 2690–2703 (2017). doi: 10.1093/jac/dkx199

Bengtsson-Palme J, Boulund F, Edström R, Feizi A, Johnning A, Jonsson VA, Karlsson FH, Pal C, Pereira MB, Rehammar A, Sánchez J, Sanli K, Thorell K: Strategies to improve usability and preserve accuracy in biological sequence databases. Proteomics, 16, 18, 2454–2460 (2016). doi: 10.1002/pmic.201600034

Save antibiotics for medicine – not as growth promoters

September 25, 2015

UPDATE: This post has been updated to reflect a valid comment that I messed up in the original post. Specific use of antibiotics as feed additives has been forbidden for years in the EU. The petition was about cutting the prophylactic use of antibiotics in animals, but that was very unclear from the original post. I thank my readers for pointing this unclarity out.

I’m going to do something unusual and ask you to sign a petition targeted at European Union ministers to support new EU laws to drastically cut the prophylactic use of antibiotics in agriculture ~~as growth promoters~~. The problem is that if ministers don’t feel the public pressure to act, the laws may be delayed or not be implemented. Examples from Denmark, Sweden, Norway and the Netherlands show that it is possible to produce meat with little or no antibiotics, but since bacteria can travel across borders (1), we need to bring the rest of the world onboard, and the EU is good first step. Therefore I ask you to sign the Avaaz petition here.

Bengtsson-Palme J, Angelin M, Huss M, Kjellqvist S, Kristiansson E, Palmgren H, Larsson DGJ, Johansson A: The human gut microbiome as a transporter of antibiotic resistance genes between continents. Antimicrobial Agents and Chemotherapy, 59, 10, 6551-6560 (2015). doi: 10.1128/AAC.00933-15 [Paper link]

Published paper: Aquatic effect-based monitoring tools

March 16, 2015

A couple of days ago a paper was published in Environmental Sciences Europe summarizing the EU report on effect-based tools for use in toxicology in the aquatic environment I have been involved in (1). This report was officially published last spring (2), and can be found here, with the annex available on the European Commission document website. My contribution to the paper was, as with the report, in the genomics and metagenomics section. The paper briefly presents modern bioassays, biomarkers and ecological methods that can be used for aquatic monitoring of the environment.

References:

Wernersson A-S, Carere M, Maggi C, Tusil P, Soldan P, James A, Sanchez W, Dulio V, Broeg K, Reifferscheid G, Buchinger S, Maas H, Van Der Grinten E, O’Toole S, Ausili A, Manfra L, Marziali L, Polesello S, Lacchetti I, Mancini L, Lilja K, Linderoth M, Lundeberg T, Fjällborg B, Porsbring T, Larsson DGJ, Bengtsson-Palme J, Förlin L, Kienle C, Kunz P, Vermeirssen E, Werner I, Robinson CD, Lyons B, Katsiadaki I, Whalley C, den Haan K, Messiaen M, Clayton H, Lettieri T, Negrão Carvalho R, Gawlik BM, Hollert H, Di Paolo C, Brack W. Kammann U, Kase R: The European technical report on aquatic effect-based monitoring tools under the water framework directive. Environmental Sciences Europe, 27, 7 (2015). doi: 10.1186/s12302-015-0039-4 [Paper link]
Wernersson A-S, Carere M, Maggi C, Tusil P, Soldan P, James A, Sanchez W, Broeg K, Kammann U, Reifferscheid G, Buchinger S, Maas H, Van Der Grinten E, Ausili A, Manfra L, Marziali L, Polesello S, Lacchetti I, Mancini L, Lilja K, Linderoth M, Lundeberg T, Fjällborg B, Porsbring T, Larsson DGJ, Bengtsson-Palme J, Förlin L, Kase R, Kienle C, Kunz P, Vermeirssen E, Werner I, Robinson CD, Lyons B, Katsiadaki I, Whalley C, den Haan K, Messiaen M, Clayton H, Lettieri T, Negrão Carvalho R, Gawlik BM, Dulio V, Hollert H, Di Paolo C, Brack W (2014). Technical Report on Aquatic Effect-Based Monitoring Tools. European Commission. Technical Report 2014-077, Office for Official Publications of European Communities, ISBN: 978-92-79-35787-9. doi:10.2779/7260

EU report on effect-based tools for ecotoxicology

May 4, 2014

Because of my previous involvement in a Swedish report on toxicological monitoring using (meta)-genomics tools [1], I also became in a related EU report on effect-based tools for use in toxicology in the aquatic environment. This report has recently been officially published [2], and can be found here, with the annex available on the European Commission document website. My contribution to this report has been in the genomics and metagenomics section (Chapter 7: OMICS techniques), in which I wrote the metagenomics part and contributed to the rest. I personally think this is a quite forward-thinking report, which is nice for a large institution such as the EU.

Länsstyrelsen i Västra Götalands län. (2012). Swedish monitoring of hazardous substances in the aquatic environment (No. 2012:23). (A.-S. Wernersson, Ed.) Current vs required monitoring and potential developments (pp. 1–291). Länsstyrelsen i Västra Götalands län, vattenvårdsenheten.
Wernersson A-S, Carere M, Maggi C, Tusil P, Soldan P, James A, Sanchez W, Broeg K, Kammann U, Reifferscheid G, Buchinger S, Maas H, Van Der Grinten E, Ausili A, Manfra L, Marziali L, Polesello S, Lacchetti I, Mancini L, Lilja K, Linderoth M, Lundeberg T, Fjällborg B, Porsbring T, Larsson DGJ, Bengtsson-Palme J, Förlin L, Kase R, Kienle C, Kunz P, Vermeirssen E, Werner I, Robinson CD, Lyons B, Katsiadaki I, Whalley C, den Haan K, Messiaen M, Clayton H, Lettieri T, Negrão Carvalho R, Gawlik BM, Dulio V, Hollert H, Di Paolo C, Brack W (2014). Technical Report on Aquatic Effect-Based Monitoring Tools. European Commission. Technical Report 2014-077, Office for Official Publications of European Communities, ISBN: 978-92-79-35787-9. doi:10.2779/7260