Galaxy Community Hub
Galaxy at CiteULike Social Bookmarking Service

The Galaxy CiteULike library recently reached a milestone: It now has over 2500 publications in it. The Galaxy CiteULike Group was launched in December 2011 and reached 1000 papers 18 months later.

To be included in the library a publication needs to reference or mention Galaxy, extend Galaxy, use or reference a Galaxy instance, or otherwise discuss Galaxy or cite one of the Galaxy Project papers.

Here’s a review of those first 2500 papers.

The Tags

Each paper is reviewed and one or more tags are added to it. The initial set featured 9 tags. This wasn’t quite enough, and 8 more were added in 2013, bringing the total to 17. (Papers from 2012 were back-curated with the new tags, but not before that.) The numbers for each tag in each year for the first 2500 papers are below.

Tag 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 #
methods 2 15 26 50 92 196 260 325 258 1224
workbench 1 3 7 11 18 36 68 127 151 205 121 748
usemain 1 1 3 89 95 64 253
tools 3 1 8 28 36 67 37 180
usepublic 1 15 57 77 150
isgalaxy 2 2 5 16 15 26 45 24 135
cloud 1 2 13 22 38 24 100
uselocal 2 28 41 27 98
shared 1 1 7 13 21 24 17 84
other 1 1 8 8 38 21 77
refpublic 10 28 26 64
reproducibility 6 7 8 24 12 57
unknown 2 5 7 3 10 13 8 5 53
project 1 2 1 5 6 10 6 7 9 47
howto 1 2 1 3 4 12 6 11 5 45
visualization 1 1 2 3 6 3 16
usecloud 2 1 1 4

A couple of trends stand out in the tag data:

  • methods and workbench have always been the most popular. methods papers use Galaxy in their analysis. workbench either just mention Galaxy or discuss the platform itself.
  • The numbers of usepublic and refpublic publications are climbing rapidly. Respectively, these are methods studies that did their analyses on a public Galaxy server other than usegalaxy.org, and papers that reference those servers in some way (besides their methodology). Part of this increase reflects better tracking of these papers, but (I believe) most of the increase reflects both the increased number of public servers (as reflected by the isgalaxy numbers), and their increased visibility.
  • reproducibility became a hot topic (finally) in 2014.

**The Journals**

Information on where the papers appeared is also available. Galaxy-related papers have appeared in over 500 different publications. Publications with 15 or more Galaxy papers are:

Journal20052006200720082009201020112012201320142015Total
Plos One0001141632424527168
Nucleic Acids Research01011113821222919125
Bmc Genomics0000241222274215124
Bioinformatics0005472319193010117
Bmc Bioinformatics00102101516825683
Genome Research12435291228351
Genome Biology000139111146247
Plos Genet000201610156444
Briefings in Bioinformatics0001113284929
Genome Announcements00000000431926
Concurrency and Computation: Practice and Experience000000130111025
Proceedings of the National Academy of Sciences0001102348423
Cell0000011583523
Plos Comput Biol0021023355223
Molecular Biology and Evolution0000301192420
Nature Communications0000000247720
Molecular Ecology00000111114018
Database0000016243117
Genome Biology and Evolution0000013235216
Nature1000131042315
Cell Reports0000000354315

There are also many unexpected publications in the list:
  • Applied Stochastic Models in Business and Industry
  • Computers & Geosciences
  • Current Opinion in Solid State & Materials Science
  • Journal of Archaeological Science

The Summary

Finally, the total number of papers per year continues to increase, and we expect 2015 to surpass 800 papers

2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 Total
Total 2 4 12 31 53 107 202 394 495 703 497 2500

We will continue to report new papers in the monthly Galaxy newsletters. New tags may also show up as the project and community evolve.

In the meantime, I expect the next 2500 papers will be published in considerably less than time than the first 2500. I’m looking forward to all that research

Dave Clements