February 2021 Galactic News
Events, platform news, blog posts, videos, pubs, jobs and releases
- **[Event news](#event-news)**:
- GCC2021 will be virtual, affordable, and globally accessible.
- GTN Smörgåsbord: A Global Galaxy Course. *Register by February 12 and we want your help!*
- 2nd Galaxy-ELIXIR webinar series, *February 10, 17, 24.*
- Papercuts and GTN CollaborationFest: *February 25.*
- CWL Mini Conference. *Starts Today.*
- Galaxy Developer Roundtable. *February 18, we need your topics.*
- Single-cell RNA-seq & network analysis using Galaxy and Cytoscape. *Apply by 26 February.*
- **[Galaxy platform news](#galaxy-platforms-news)**:
- ATACgraph
- CorGAT on Laniakea
- Everything on UseGalaxy.eu
- Plus more UseGalaxy.\* news
- **[Blog posts](#galactic-blog-activity)**:
- Nora, the new visualisation in Galaxy
- Three articles related to COVID-19 research using Galaxy in the de.NBI brochure
- Trusted CI and SGCI Collaborate to Secure the Galaxy Platform
- New and updated CNV and Variant Calling tools
- Variant Analysis of SARS-CoV-2 Sequencing Data: January 2021 update
- Major update to SearchGUI and PeptideShaker
- Try out Galaxy in Terra via AnVIL
- Analysis of RNA-seq data from neurodegenerative disease
- Galaxy Metabolomics Mini-Symposium report
- **[Training material and doc updates](#doc-hub-and-training-updates)**:
- Report on 2021 Galaxy Admin Training
- Galaxy Server Administration Tutorials Update
- An introduction to scRNA-seq data analysis
- Gallantries implements automated text to speech for slides...
- ... resulting in lots of new videos!
- Life Science Trainers Questionnaire
- Functionally Assembled Terrestrial Ecosystem Simulator (FATES)
- Proteogenomics 2: Database Search
- Chloroplast genome assembly
- **[Publications](#publications)**:
- Tool recommender system in Galaxy using deep learning
- Comparative ligand structural analytics illustrated on variably glycosylated MUC1 antigen–antibody binding
- PDAUG - a Galaxy based toolset for peptide library analysis, visualization, and machine learning modeling
- CorGAT: A tool for the functional annotation of SARS-CoV-2 genomes
- Deep Learning for Detection and Segmentation in High-Content Microscopy Images
- ATACgraph: Profiling Genome-Wide Chromatin Accessibility From ATAC-seq
- SHARP: Harmonizing Galaxy and Taverna workflow provenance
- SARS-CoV-2 RECoVERY: A multi-platform open-source bioinformatic pipeline for the automatic construction and analysis of SARS-CoV-2 genomes from NGS sequencing data
- **[Who's hiring?](#whos-hiring)**
- Europe:
- INRAE & AgroParisTech, Norwegian University of Life Science, IFB, VIB
- North America:
- Cleveland Clinic, Roche, Johns Hopkins (AnVIL, 2 positions; Galaxy, 2 Positions)
- Europe:
- **[New releases](#releases)**:
- Galaxy Language Server 0.3.2
- **[Other News](#other-news)**:
- Chan Zuckerberg Initiative injects funds into Galaxy platform for biomedical research
- Nuovo software per l’analisi genomica del Covid
- EMBL COVID-19 Data Platform
If you have anything to include to next month's newsletter, then please send it to outreach@galaxyproject.org.
Event News
Despite COVID-19, there is still a lot going on, although online. We have updated our list of events to reflect what we know. Some highlights:
The 2021 Galaxy Community Conference will be virtual, affordable, and globally accessible.
Things are just too uncertain to continue to plan an in person event for early July. GCC2021 won't be in person, but it will be more accessible and affordable because of it:
- GCC2021 events will be held twice each day, once in their original Ghent time zone (GCC EMEA/APO), and again 8 hours later in the Americas (GCC Americas).
- Registration rates will be a fraction of the cost of an in person event, and will include a significant discount for students and researchers based in low and lower-middle income countries.
Other things are changing too. See the announcement for details.
15-19 February, Online, Global
This week-long workshop on how to use Galaxy will be online, global, and free. The program covers a general introduction to the Galaxy platform, NGS Analysis (DNA-seq and RNA-seq), Proteomics, and also features a Choose your own adventure day (!?).
Instructors and Instructor Support Wanted
Are you interested in teaching with Galaxy, and would you like to be involved in this event? Then we want your help.
10, 17, 24 February
Open Data Infrastructures to tackle COVID-19 pandemic
This series of webinars features experts from ELIXIR and the global Galaxy community demonstrating how open access and open science are fundamental for fast and efficient response to public health crises. This webinar series wraps up this month:
Insights from selection analysis of complete genomes and read-level data, 10 February
Viral Beacon and Galaxy variant workflows, 17 February
DRS, long-read-sequencing, proteomics and more — an update to recent COVID-19 workflow developments, 24 February
25 February, Online, Global
The February GTN CoFest & Community calls are joining forces with the February Papercuts Cofest day for a concurrent, 24 hour long event, starting in Melbourne, Australia, and following the sun all the way to Portland, Oregon.
Both events will feature community calls throughout the day with community engagement on chat throughout the day. This is an excellent opportunity to become a contributor to the global open science, open source, and open access Galaxy community.
Please join us online on 25 February, wherever you are in the world.
8-10 February, Online, Global
Three half days of talks, discussions and co-working time, all about the Common Workflow Language.
18 February, Online, Global
Working in mass spec? The Galaxy mass spec community gathers every 6 weeks and the next meetup is February 18. The meeting will include a panel discussion on Tools and Workflows for Mass Spec in Galaxy and GTN. Please join us.
There next roundtable meetup will be:
February 18: Featuring You!, We don't yet have topics for this round table. If you have topics you want to discuss, please submit them by February 15.
26-30 April, Online; Apply by 26 February
"This course utilises Galaxy pipelines, an online open-access resource that allows even the most computer-phobic bench scientists to analyse their biological data. Participants will be guided through the droplet-based scRNA-seq analysis pipelines from raw reads to cell cluster comparisons."
Galaxy Platforms News
The Galaxy Platform Directory lists resources for easily running your analysis on Galaxy, including publicly available servers, cloud services, and containers and VMs that run Galaxy. Here's the recent platform news we know about:
The ATACgraph Galaxy container profiles accessible chromatin regions and provides ATAC-seq-specific information including definitions of nucleosome-free regions (NFRs) and nucleosome-occupied regions. ATACgraph also allows identification of differentially accessible regions between two ATAC-seq datasets.
CorGAT aligns complete assemblies of SARS-CoV-2 genomes wih the reference genomic sequence, to obtain a list of polymorphic positions and to annotate genetic variants. See the CorGAT Manual for more. CorGAT is one of several servers that runs on Laniakea.
The Galaxy Europe team has been busy creating more customized Galaxy instances:
These are just a few of the large number of specialty subdomains hosted by UseGalaxy.eu.
- Lots of tool updates on UseGalaxy.eu and UseGalaxy.org.au.
- UseGalaxy.eu has increased in 11 TB its RAM thanks to de.NBI-Cloud.
- The European Galaxy server listed as an official resource on the COVID-19 Data Portal.
Galactic Blog Activity
A new framework for medical imaging research, Nora, has been added to the Galaxy visualisations. Nora has been developed by Dr. Marco Reisert and Dr. Elias Kellner at the Department of Radiology from the University Medical Center of Freiburg.
Nora will be included in the next Galaxy release, 21.01.
Three articles using Galaxy have been included in the de.NBI brochure Data analysis for the COVID-19 Research.
- RNA Bioinformatics to Analyze SARS-CoV-2 – The Causative Agent of COVID-19, by Wolfgang R. Hess, Steffen C. Lott, Steve Hoffmann and Rolf Backofen.
- Open Data, Software and Analytics as a response to emerging pathogen threats, by Beatriz Serrano-Solano, Wolfgang Maier, Simon Bray, Gianmauro Cuccuru, Anika Erxleben, Bérénice Batut, Mehmet Tekman, Rolf Backofen and Björn Grüning.
- Virtual Screening for SARS-CoV-2 Drug Development using Open Research and Compute Infrastructures, by Simon Bray, Beatriz Serrano-Solano and Björn Grüning.
By Kelli Shute.
Trusted CI and Galaxy reviewed the security of a new Galaxy software distribution being developed as a containerized package, with an eye toward its use with sensitive information such as protected health information (PHI). See the announcement and the report for details.
By Wolfgang Maier.
Wolfgang highlights a wide variety (14!) of tools useful for the analysis of Copy Number Variation (CNV).
By Wolfgang Maier and Björn Grüning.
Continuous tracking of viral evolution through genome sequencing.
By Carlos Horro Marcos.
After 3 years of work, SearchGUI (SG), a tool that performs protein identification using various search engines, and PeptideShaker (PS) for protein identification (which uses SearchGUI results) have been deeply updated and released in new major versions: 4.0.7 and 2.0.5, respectively.
By Geraldine Van der Auwera.
Run your own personal, and customizable Galaxy server in the secure (FISMA moderate) AnVIL environment.
By Lachlan Gray.
Lachlan tells the training experience or the SciX program at the University of New South Wales, Sydney.
By Melanie C. Föll.
The past 29th of January, the Galaxy Metabolomics community had the first Mini-Symposium in which users as well as developers had the chance to present their work and discuss the fuure integration into Galaxy.
To know more about the Galaxy Metabolomics community, please subscribe to the mailing list or join the metabolomics Gitter channel.
Doc, Hub, and Training Updates
By Helena Rasche.
Galaxy Admin Training was held online for the first time and running a global course for 88 participants required us to develop a large number of innovations we’re very excited to share with everyone. Here is what we learned and future directions.
By Galaxy Admin Training Instructors.
As a result of the recently completed Admin Training course, most Galaxy Server Admin topics were updated:
- Ansible
- Galaxy Installation with Ansible
- Connecting Galaxy to a compute cluster
- Data Libraries
- Distributed Object Storage
- Galaxy Monitoring with Reports
- Galaxy Monitoring with Telegraf and Grafana
- Galaxy Tool Management with Ephemeris
- Mapping Jobs to Destinations
- plus at least 6 more hands on examples and at least 13 slide decks.
By Mehmet Tekman.
This new slide deck (and accompanying video, see next) provide a broad introduction to single-cell RNA-seq analysis concepts.
Mehmet updated the Pre-processing of 10X Single-Cell RNA Datasets tutorial as well.
By Helena Rasche.
This new feature will help the GTN contributors, as they will only need to enable video and opt-in to automatic video production. The video will be re-built anytime a contributor updates their slide decks in a completely automatic way.
A number of tutorials have already created videos using the new text to speech capability:
- Functionally Assembled Terrestrial Ecosystem Simulator (FATES), added by Anne Fouilloux
- Genome Annotation with Prokka, added by Anthony Bretaudeau
- An introduction to scRNA-seq data analysis, added by Mehmet Tekman
- Thirteen videos on Galaxy Server Administration, created by the 2021 Galaxy Admin Training instructors.
- Scripting Galaxy using the API and BioBlend, added by Nicola Soranzo
The global Life Science Trainers Community (yes, you should join) wants to highlight the important role of trainers and understand how they can be better supported. If you are, or have been a trainer, share your thoughts in this survey by March 20.
By Anne Fouilloux and Hui Tang.
Familiarize yourself (especially you ecologists) with how to run a terrestrial ecosystem model (i.e., CLM-FATES) at site-level in Galaxy and then analyze the model results.
This tutorial on proteogenomic database searching using mass spectrometry data got a major update from Subina Mehta and JJ Johnson.
This genome assembly tutorial got an update from Anna Syme.
Publications
Pub curation activities are on a semi-hiatus right now but a few publications referencing, using, extending, and implementing Galaxy were added to the Galaxy Publication Library anyway. Here are the new open access Galactic and Stellar pubs:
Kumar, A., Rasche, H., Grüning, B., & Backofen, R. (2021). GigaScience, 10(giaa152). DOI: 10.1093/gigascience/giaa152
Barnett, C. B., Senapathi, T., & Naidoo, K. J. (2020). Beilstein Journal of Organic Chemistry, 16(1), 2540–2550. DOI: 10.3762/bjoc.16.206
Joshi, J., & Blankenberg, D. (2021). BioRxiv, 2021.02.02.429203. DOI: 10.1101/2021.02.02.429203
Chiara, M., Zambelli, F., Tangaro, M. A., Mandreoli, P., Horner, D. S., & Pesole, G. (2020). Bioinformatics, btaa1047. DOI: 10.1093/bioinformatics/btaa1047
Wollmann, T. S. (2020). [Dissertation, Heidelberg University]. DOI: 10.11588/heidok.00028827
Lu, R. J.-H., Liu, Y.-T., Huang, C. W., Yen, M.-R., Lin, C.-Y., & Chen, P.-Y. (2021). Frontiers in Genetics, 11. DOI: 10.3389/fgene.2020.618478
Gaignard, A., Belhajjame, K., & Skaf-Molli, H. (2017, May). SeWeBMeDA 2017 : Semantic Web Solutions for Large-Scale BioMedical Data Analtics.
Sabato, L. D., Vaccari, G., Knijn, A., Ianiro, G., Bartolo, I. D., & Morabito, S. (2021). BioRxiv, 2021.01.16.425365. DOI: 10.1101/2021.01.16.42536
Who's Hiring
Blankenberg Lab, Genomic Medicine Institute, Cleveland Clinic Lerner Research Institute, Cleveland, Ohio, United States
Utilize high-throughput omics technologies, such as next generation sequencing, and data-intensive computing to explore biomedical research questions.
Galaxy-SynBioCAD team, MICALIS Institute, INRAE & AgroParisTech, Jouy-en-Josas, France
Design solutions to synthesize molecules in microorganisms & to implement the sesolutions on robotized workstations.
Apply by 1 March.
MEMO Group, Norwegian University of Life Science, Ås, Norway
Interested in host-microbiome interactions and multi-omic data? We have multiple positions starting in 2021. Projects have fun and interesting EU partners. Will be hiring after Christmas.
Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University.
Data science research and education focusing on genomics (AnVIL, Genomic Data Science Community Network), cancer (ITCR) or pain A2CPS.
Roche, Bay Area, California, United States.
- Lead data mining for biomarker discovery for medical conditions of interest.
- Develop Agile Assay Design (AAD) tools for qPCR tests.
- NGS data analysis tools and/or workflows.
- Use these tools & workflows for R\&D projects.
- Deploy these tools on Roche intranet (Galaxy) and train scientists to use them.
Johns Hopkins University, Baltimore, Maryland, United States.
Provide technical expertise and oversight for the AnVIL Project, which incorporates Galaxy, Bioconductor, Terra, Gen3, and Dockstore into a secure cloud-based software ecosystem for genomic data analysis.
The Schatz Lab at Johns Hopkins University is looking for:
- Self-driven individuals that can work independently to fill multiple software development positions on the Galaxy Project.
- Ambitious individuals to fill a programmer analyst position working on the Galaxy and AnVIL projects.
The French Institute of Bioinformatics (IFB) is offering a 1-year position for a developer to work on usegalaxy.fr, focused on the contribution to the development, evolution, deployment and maintenance of the French infrastructure.
VIB-UGent Center for Plant Systems Biology has two open positions to work on the ELIXIR Belgium research data analysis team, both for an initial duration of 2 years.
Releases
Galaxy Language Server and Galaxy Tools VS Extension assist in the development fo Galaxy tools wrappers inside modern code editors.
The release 0.3.2. includes fixes and new features. See the GitHub repository for details.
Other News
This news item from the Australian Research Data Commons (ADRC) highlights the recent grant to extend Galaxy, and Galaxy Australia's role in the effort.
Also see Nuwan Goonasekera shares in Chan Zuckerberg Initiative grant for the Galaxy Project from Melbourne Bioinformatics.
Cnr e Statale di Milano hanno realizzato un nuovo software per l’analisi genomica del SARS-CoV-2: lo studio è stato pubblicato sulla rivista Bioinformatics.
As the EMBL COVID19 Data Platform expands, it has enabled our partners at ELIXIR Belgium, Galaxy Project and Open Targets to build useful infectious disease tools and services on top of it.