Tool or resource | Description | Related pages | Registry |
---|---|---|---|
1+ Million Genomes (1+MG) | The 1+ Million Genomes (1+MG) initiative aims to enable secure access to genomics and the corresponding clinical data across Europe for better research, personalised healthcare and health policy making. Since the Digital Day 2018, 25 EU countries, the UK and Norway signed Member States declaration on stepping up efforts towards creating a European data infrastructure for genomic data and implementing common national rules enabling federated data access. The initiative forms part of the EU's agenda for the Digital Transformation of Health and Care and is aligned with the goals of the European Health Data Space. | Training | |
ACE Cohort | Asymptomatic COVID-19 in Education (ACE) Cohort | Data sources | Training |
ADA-M | Responsible sharing of biomedical data and biospecimens via the Automatable Discovery and Access Matrix (ADA-M). The Automatable Discovery and Access Matrix (ADA-M) provides a standardized way to unambiguously represent the conditions related to data discovery and access. By adopting ADA-M, data custodians can generally describe what their data are (the Header section), who can access them (the Permissions section), terms related to their use (the Terms section), and special conditions (the Meta-Conditions). By doing so, data custodians can participate in data sharing and collaboration by making meta information about their data computer-readable and hence directly available for digital communication, searching and automation activities. | Data sources | Tool info |
ANNOVAR | ANNOVAR is an efficient software tool to utilize update-to-date information to functionally annotate genetic variants detected from diverse genomes. | Data analysis | Tool info |
ArrayExpress | ArrayExpress is a database of functional genomics experiments that can be queried and the data downloaded. It includes gene expression data from microarray and high throughput sequencing studies. Data is collected to MIAME and MINSEQE standards. Experiments are submitted directly to ArrayExpress or are imported from the NCBI GEO database. | Data sources | Tool info Standards/Databases Training |
Arvados | With Arvados, bioinformaticians run and scale compute-intensive workflows, developers create biomedical applications, and IT administrators manage large compute and storage resources. | Data analysis | |
Beacon | Beacon v2 is a protocol/specification established by the Global Alliance for Genomics and Health initiative (GA4GH) that defines an open standard for federated discovery of genomic data and associated information in biomedical research and clinical applications. The Beacon v2 standard - a major update from the original GA4GH Beacon first proposed in 2014 - was accepted by GA4GH in April 2022. Beacon v2 consists of two components, the Framework and the Models. The Framework contains the format for the requests and responses, whereas the Models define the structure of the biological data response. | Data sources | Tool info Standards/Databases Training |
Beyond 1 Million Genomes (B1MG) | The Beyond 1 Million Genomes (B1MG) project is helping to create a network of genetic and clinical data across Europe. The project provides coordination and support to the 1+ Million Genomes Initiative (1+MG). This initiative is a commitment of 24 EU countries, the UK and Norway to give cross-border access to one million sequenced genomes by 2022. | ||
BioGRID | BioGRID is a comprehensive biomedical repository for curated protein, genetic and chemical interactions | Data analysis | Tool info Standards/Databases |
BioSamples | BioSamples stores and supplies descriptions and metadata about biological samples used in research and development by academia and industry. Samples are either 'reference' samples (e.g. from 1000 Genomes, HipSci, FAANG) or have been used in an assay database such as the European Nucleotide Archive (ENA) or ArrayExpress. It provides links to assays and specific samples, and accepts direct submissions of sample information. | Data sources | Tool info Standards/Databases Training |
Bismark | Bismark is a program to map bisulfite treated sequencing reads to a genome of interest and perform methylation calls in a single step. | Data analysis | Tool info Training |
Bitbucket | Git based code hosting and collaboration tool, built for teams. | Data analysis | Standards/Databases |
Bowtie2 | Bowtie 2 is an ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences. | Data analysis | Tool info Training |
BWA | BWA is a software package for mapping DNA sequences against a large reference genome, such as the human genome. | Data analysis | Tool info Training |
Canu | Canu is a fork of the Celera Assembler, designed for high-noise single-molecule sequencing. | Data analysis | Tool info |
CESSDA Data Catalogue (CDC) | CDC is a one-stop shop for searching and finding European social science data. | Data sources | Tool info Standards/Databases |
CESSDA Vocabulary Service | CESSDA Vocabulary Service enables users to discover, browse, and download controlled vocabularies in a variety of languages. The service is provided by the Consortium of European Social Science Data Archives (CESSDA). The majority of the source (English) vocabularies included in the service have been created by the DDI Alliance. The Data Documentation Initiative (DDI) is an international standard for describing data produced by surveys and other observational methods in the social, behavioural, economic, and health sciences. | Standards/Databases | |
ClustalW | ClustalW is a progressive multiple sequence alignment tool to align a set of sequences by repeatedly aligning pairs of sequences and previously generated alignments. | Data analysis | Tool info |
Common Workflow Language (CWL) | An open standard for describing workflows that are build from command line tools | Provenance | Standards/Databases Training |
COMPSs | COMP Superscalar (COMPSs) is a task-based programming model which aims to ease the development of applications for distributed infrastructures, such as large High-Performance clusters (HPC), clouds and container managed clusters. | Provenance | Tool info |
Covid-19 data portal | The COVID-19 Data Portal enables researchers to upload, access and analyse COVID-19 related reference data and specialist datasets. The aim of the COVID-19 Data Portal is to facilitate data sharing and analysis, and to accelerate coronavirus research. The portal includes relevant datasets submitted to EMBL-EBI as well as other major centres for biomedical data. The COVID-19 Data Portal is the primary entry point into the functions of a wider project, the European COVID-19 Data Platform. | Data sources Data sources | Tool info Standards/Databases Training |
CRG COVID-19 Viral Beacon | A platform allowing for browsing SARS-CoV-2 variability at the genome, amino acid, structural, and motif levels | An automated SARS-CoV-... | |
Cromwell | Cromwell is a Workflow Management System geared towards scientific workflows. | Data analysis | |
cwltool | Reference implementatio to provide comprehensive validation of CWL files as well as provide other tools related to working with CWL. | Data analysis | |
Cytoscape | Cytoscape provides a solid platform for network visualization and analysis | Data analysis | Tool info Training |
DAVID | The Database for Annotation, Visualization and Integrated Discovery (DAVID) provides a comprehensive set of functional annotation tools for investigators to understand the biological meaning behind large lists of genes. | Data analysis | Tool info Training |
dbGaP | The Database of Genotypes and Phenotypes (dbGaP) archives and distributes the results of studies that have investigated the interaction of genotype and phenotype. Such studies include genome-wide association studies, medical sequencing, molecular diagnostic assays, as well as association between genotype and non-clinical traits. | Data sources | Tool info Standards/Databases Training |
dbNSFP | A comprehensive database of transcript-specific functional predictions and annotations for human nonsynonymous and splice-site SNVs | Data analysis | Tool info |
DCAT | An RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web. By using DCAT to describe datasets in data catalogs, publishers increase discoverability and enable applications easily to consume metadata from multiple catalogs. | Data sources | Standards/Databases |
DeepVariant | DeepVariant is a deep learning-based variant caller that takes aligned reads (in BAM or CRAM format), produces pileup image tensors from them, classifies each tensor using a convolutional neural network, and finally reports the results in a standard VCF or gVCF file. | Data analysis | Tool info |
Delly | Delly is an integrated structural variant (SV) prediction method that can discover, genotype and visualize deletions, tandem duplications, inversions and translocations at single-nucleotide resolution in short-read and long-read massively parallel sequencing data. | Data analysis | Tool info |
DESeq2 | Differential gene expression analysis based on the negative binomial distribution | Data analysis | Tool info Training |
DMP Online | DMP online is an online planning tool to help you write an effective DMP based on an institutional or funder template. | Training | |
Docker | Docker is a software for the execution of applications in virtualized environments called containers. It is linked to DockerHub, a library for sharing container images | Data analysis | Standards/Databases Standards/Databases Training |
Dragen-GATK | DRAGEN-GATK Best Practices contains open-source workflows that are compatible between Illumina's platforms and mainstream infrastructure. | Data analysis | |
Dryad | Dryad is an open-source, community-led data curation, publishing, and preservation platform for CC0 publicly available research data. Dryad has a long-term data preservation strategy, and is a Core Trust Seal Certified Merritt repository with storage in US and EU at the San Diego Supercomputing Center, DANS, and Zenodo. While data is undergoing peer review, it is embargoed if the related journal requires / allows this. Dryad is an independent non-profit that works directly with: researchers to publish datasets utilising best practices for discovery and reuse; publishers to support the integration of data availability statements and data citations into their workflows; and institutions to enable scalable campus support for research data management best practices at low cost. Costs are covered by institutional, publisher, and funder members, otherwise a one-time fee of $120 for authors to cover cost of curation and preservation. Dryad also receives direct funder support through grants. | Data sources | Standards/Databases |
Dutch COVID-19 Data Portal | The dutch COVID-19 Data Portal provides researchers with a clear overview of what is available, allow searching for specific data and make access to such data easier when the necessary ethical and legal conditions have been met. | Data sources | |
EBI | The European Bioinformatics Institute is a bioinformatics research center that is part of the European Molecular Biology Laboratory and is located in Hinxton, England. The institution combines intense research activity with the development and maintenance of a set of bioinformatics lines, services and databases. | Data sources | Training |
EdgeR | Empirical Analysis of Digital Gene Expression Data in R | Data analysis | Tool info Training |
ENA upload tool | Submits experimental data and respective metadata to the European Nucleotide Archive (ENA). | SARS-CoV-2 sequencing ... | |
Enrichr | Functional Enrichment Analysis and Network Construction | Tool info | |
Estonian Biobank | The Estonian Biobank has established a population-based biobank of Estonia with a current cohort size of more than 200,000 individuals (genotyped with genome-wide arrays), reflecting the age, sex and geographical distribution of the adult Estonian population. Considering the fact that about 20% of Estonia's adult population has joined the programme, it is indeed a database that is very important for the development of medical science both domestically and internationally. | Data sources | |
Estonian COVID-19 Data Portal | Estonian instance of the COVID-19 Data Portal. Among other informaiton, served Estonian SARS-CoV-2 sequencing dashboards. | SARS-CoV-2 sequencing ... | |
EUI COVID-19 SSH Data Portal | The COVID-19 SSH Data Portal provides integrated search, discovery, and linking to datasets published on the web relevant for COVID-19-related research in the Social Sciences and Humanities. | Data sources | Standards/Databases |
European Genome-phenome Archive (EGA) | The European Genome-phenome Archive (EGA) is a service for permanent archiving and sharing of personally identifiable genetic, phenotypic, and clinical data generated for the purposes of biomedical research projects or in the context of research-focused healthcare systems. Access to data must be approved by the specified Data Access Committee (DAC). | Data sources | Tool info Standards/Databases Training |
European Language Social Science Thesaurus (ELSST) | The European Language Social Science Thesaurus (ELSST) is a broad-based, multilingual thesaurus for the social sciences. It is owned and published by the Consortium of European Social Science Data Archives (CESSDA) and its national Service Providers. The thesaurus consists of over 3,300 concepts and covers the core social science disciplines: politics, sociology, economics, education, law, crime, demography, health, employment, information, communication technology, and environmental science. ELSST is used for data discovery within CESSDA and facilitates access to data resources across Europe, independent of domain, resource, language, or vocabulary. ELSST is currently available in 16 languages: Danish, Dutch, Czech, English, Finnish, French, German, Greek, Hungarian, Icelandic, Lithuanian, Norwegian, Romanian, Slovenian, Spanish, and Swedish | Standards/Databases | |
European Nucleotide Archive (ENA) | Provides a record of the nucleotide sequencing information. It includes raw sequencing data, sequence assembly information and functional annotation. | Data sources An automated SARS-CoV-... SARS-CoV-2 sequencing ... | Tool info Standards/Databases Training |
fairsharing | fairsharing is a FAIR-supporting resource that provides an informative and educational registry on data standards, databases, repositories and policy, alongside search and visualization tools and services that interoperate with other FAIR-enabling resources. fairsharing guides consumers to discover, select and use standards, databases, repositories and policy with confidence, and producers to make their resources more discoverable, more widely adopted and cited. Each record in fairsharing is curated in collaboration with the maintainers of the resource themselves, ensuring that the metadata in the fairsharing registry is accurate and timely. Every record is manually reviewed at least once a year. Records can be collated into collections, based on a project, society or organisation, or Recommendations, where they are collated around a policy, such as a journal or funder data policy. | Data sources | Standards/Databases Training |
Federated EGA | The Federated EGA is an infrastructure built upon the European Genome-phenome Archive (EGA), an EMBL-EBI and CRG data resource for secure archiving and sharing of human sensitive biomolecular and phenotypic data resulting from biomedical research projects. | Data sources | Training |
Figshare | Figshare is a generalist, subject-agnostic repository for many different types of digital objects that can be used without cost to researchers. Data can be submitted to the central figshare repository (described here), or institutional repositories using the figshare software can be installed locally, e.g. by universities and publishers. Metadata in figshare is licenced under is CC0. figshare has also partnered with DuraSpace and Chronopolis to offer further assurances that public data will be archived under the stewardship of Chronopolis. figshare is supported through Institutional, Funder, and Governmental service subscriptions. | Data sources | Standards/Databases Training |
Flye | Flye is a de novo assembler for single-molecule sequencing reads, such as those produced by PacBio and Oxford Nanopore Technologies. | Data analysis | Tool info Training |
FreeBayes | FreeBayes is a Bayesian genetic variant detector designed to find small polymorphisms, specifically SNPs, indels, MNPs, and complex events smaller than the length of a short-read sequencing alignment. | Data analysis | Tool info Training |
GA4GH | The metadata model for GA4GH, an international coalition of both public and private interested parties, formed to enable the sharing of genomic and clinical data. | Tool info Standards/Databases Training | |
Galaxy | Open, web-based platform for data intensive biomedical research. Whether on the free public server or your own instance, you can perform, reproduce, and share complete analyses. | Data analysis Provenance | Tool info Training |
Galaxy Europe | The European Galaxy server. Provides access to thousands of tools for scalable and reproducible analysis. | An automated SARS-CoV-... | |
Galaxy University of Tartu | The University of Tartu Galaxy instance. Enables local university users to run their analyses in the Galaxy environment. Was heavily used during the KoroGenoEST sequencing studies. | SARS-CoV-2 sequencing ... | |
GenBank | GenBank is the NIH genetic sequence database of annotated collections of all publicly available DNA sequences. | Data sources | Tool info Standards/Databases Training |
GeneMANIA | GeneMANIA helps you predict the function of your favourite genes and gene sets. | Data analysis | Tool info Training |
Genomic Data Infrastructure (GDI) | The Genomic Data Infrastructure (GDI) project is enabling access to genomic and related phenotypic and clinical data across Europe. It is doing this by establishing a federated, sustainable and secure infrastructure to access the data. It builds on the outputs of the Beyond 1 Million Genomes (B1MG) project and is realising the ambition of the 1+Million Genomes (1+MG) initiative. | ||
GEO | The Gene Expression Omnibus (GEO) is a public repository that archives and freely distributes microarray, next-generation sequencing, and other forms of high-throughput functional genomic data submitted by the scientific community. Accepts next generation sequence data that examine quantitative gene expression, gene regulation, epigenomics or other aspects of functional genomics using methods such as RNA-seq, miRNA-seq, ChIP-seq, RIP-seq, HiC-seq, methyl-seq, etc. GEO will process all components of your study, including the samples, project description, processed data files, and will submit the raw data files to the Sequence Read Archive (SRA) on the researchers behalf. In addition to data storage, a collection of web-based interfaces and applications are available to help users query and download the studies and gene expression patterns stored in GEO. | Data sources | Standards/Databases Training |
ggplot2 | ggplot2 is a system for declaratively creating graphics, based on The Grammar of Graphics. | Data analysis | Tool info Training |
GitHub | GitHub is a versioning system, used for sharing code, as well as for sharing of small data. | Data analysis | Standards/Databases Standards/Databases Training |
GitLab | GitLab is an open source end-to-end software development platform with built-in version control, issue tracking, code review, CI/CD, and more. Self-host GitLab on your own servers, in a container, or on a cloud provider. | Data analysis | Standards/Databases Training |
GO | GO is to perform enrichment analysis on gene sets. | Data analysis | Tool info Training |
GRIDSS | GRIDSS is a module software suite containing tools useful for the detection of genomic rearrangements. | Data analysis | Tool info |
GSEA | Gene Set Enrichment Analysis (GSEA) is a computational method that determines whether an a priori defined set of genes shows statistically significant, concordant differences between two biological states | Data analysis | Tool info Training |
GTEx | The Genotype-Tissue Expression (GTEx) project is an ongoing effort to build a comprehensive public resource to study tissue-specific gene expression and regulation. Samples were collected from 53 non-diseased tissue sites across nearly 1000 individuals, primarily for molecular assays including WGS, WES, and RNA-Seq. Remaining samples are available from the GTEx Biobank. The GTEx Portal provides open access to data including gene expression, QTLs, and histology images. | Data sources | Tool info Standards/Databases Training |
HISAT2 | HISAT2 is a fast and sensitive alignment program for mapping next-generation sequencing reads (whole-genome, transcriptome, and exome sequencing data) to a population of human genomes (as well as to a single reference genome). | Data analysis | Tool info Training |
IGV | The Integrative Genomics Viewer (IGV) is a high-performance, easy-to-use, interactive tool for the visual exploration of genomic data. | Data analysis | Tool info Training |
IntAct | IntAct (Molecular Interaction Database) Website | Data analysis | Tool info Standards/Databases Training |
KEGG | A set of annotation maps for Kyoto encyclopedia of genes and genomes (KEGG) | Data analysis | Tool info Training |
LimeSurvey | LimeSurvey is a free and open source advanced online survey system to create online surveys. | ||
Lumpy | A probabilistic framework for structural variant discovery. | Data analysis | Tool info |
MACS | Model-based Analysis of ChIP-Seq (MACS), for identifying transcript factor binding sites. | Data analysis | Tool info Training |
MAFFT | MAFFT is a multiple sequence alignment program | Data analysis | Tool info |
Manta | Manta calls structural variants (SVs) and indels from mapped paired-end sequencing reads. | Data analysis | Tool info |
matplotlib | Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python. | Data analysis | Tool info |
MetaboAnalyst | MetaboAnalyst is a comprehensive platform dedicated for metabolomics data analysis via user-friendly, web-based interface. | Data analysis | Tool info Training |
Metagen-FastQC | Cleans metagenomic reads to remove adapters, low-quality bases and host (e.g. human) contamination. | SARS-CoV-2 sequencing ... | |
MethylKit | methylKit is an R package for DNA methylation analysis and annotation from high-throughput bisulfite sequencing. | Data analysis | Tool info |
methylPipe | Base resolution DNA methylation data analysis | Data analysis | Tool info |
MetSign | A computational platform for high-resolution mass spectrometry-based metabolomics | Data analysis | |
MIABIS | MIABIS represents the minimum information required to initiate collaborations between biobanks and to enable the exchange of biological samples and data. The aim is to facilitate the reuse of bio-resources and associated data by harmonizing biobanking and biomedical research. | Data sources | Standards/Databases |
MUSCLE | MUSCLE is widely-used software for making multiple alignments of biological sequences. | Data analysis | Tool info |
Mzmine | MZmine 3 is an open-source software for mass-spectrometry data processing, with the main focus on LC-MS data. | Data analysis | Tool info |
NCBI | The National Center for Biotechnology Information advances science and health by providing access to biomedical and genomic information. | Training | |
Nextflow | Nextflow is a framework for data analysis workflow execution | Data analysis | Tool info Training |
Nextstrain Auspiece | Estonian local instance of the Nextstrain Auspiece application that serves SARS-CoV-2 phylogenetic data | SARS-CoV-2 sequencing ... | |
ODDISEI Secure ANalysis Environment (SANE) | SANE is a virtual container in which the researcher can analyse sensitive data, while the data owner retains full control. | Data sources | |
Omicsgenerator | Omics Integrator is a package designed to integrate proteomic data, gene expression data and/or epigenetic data using a protein-protein interaction network. | Data analysis | |
OpenMS | OpenMS is an open-source software C++ library for LC-MS data management and analyses. | Data analysis | Tool info Training |
Panther | The PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System is a unique resource that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function even in the absence of direct experimental evidence. | Data sources | Tool info Standards/Databases Training |
PhyML | PhyML is a software package that uses modern statistical approaches to analyse alignments of nucleotide or amino acid sequences in a phylogenetic framework. | Data analysis | Tool info |
R Markdown | R Markdown can help to turn your analyses into high quality documents, reports, presentations and dashboards. | Training | |
R Shiny | Shiny is an R package that makes it easy to build interactive web apps straight from R. | Training | |
Research Object Crate (RO-Crate) | RO-Crate is a lightweight approach to packaging research data with their metadata, using schema.org. An RO-Crate is a structured archive of all the items that contributed to the research outcome, including their identifiers, provenance, relations and annotations. | Provenance | Standards/Databases |
Sapporo WES | Implementation of Workflow Execution Service (WES) or so-called Workflow-as-a-Service. | Provenance | Tool info |
SARS-CoV-2 Data Hubs | Using technology that builds upon existing EMBL-EBI infrastructure, we provide SARS-CoV-2 Data Hubs to those public health agencies and other scientific groups responsible for generating viral sequence data from the outbreak at national or regional levels. | Data sources | |
Schema.org | Schema.org is a collaborative, community activity with a mission to create, maintain, and promote schemas for structured data on the Internet, on web pages, in email messages, and beyond. | Provenance | Standards/Databases Training |
SICER2 | Redesigned and improved ChIP-seq broad peak calling tool SICER | Data analysis | |
Singularity | Singularity is a widely-adopted container runtime that implements a unique security model to mitigate privilege escalation risks and provides a platform to capture a complete application environment into a single file (SIF) | Data analysis | Training |
Snakemake | Snakemake is a framework for data analysis workflow execution | Data analysis Provenance | Tool info Training |
SnpEff | Genetic variant annotation and functional effect prediction toolbox. It annotates and predicts the effects of genetic variants on genes and proteins. | Data analysis | Tool info Training |
SPAdes | SPAdes is an assembly toolkit containing various assembly pipelines. | Data analysis | Tool info Training |
SRA | The SRA is NIH's primary archive of high-throughput sequencing data and is part of the International Nucleotide Sequence Database Collaboration (INSDC) that includes at the NCBI Sequence Read Archive (SRA), the European Bioinformatics Institute (EBI), and the DNA Database of Japan (DDBJ). Data submitted to any of the three organizations are shared among them. SRA accepts data from all kinds of sequencing projects including clinically important studies that involve human subjects or their metagenomes, which may contain human sequences. These data often have a controlled access via dbGaP (the database of Genotypes and Phenotypes). | Data sources | Tool info Standards/Databases Training |
STAR | Spliced Transcripts Alignment to a Reference | Data analysis | Tool info Training |
StreamFlow | Container-native workflow manager for hybrid infrastructures | Provenance | |
TCGA | The Cancer Genome Atlas (TCGA) is a comprehensive, collaborative effort led by the National Institutes of Health (NIH) to map the genomic changes associated with specific types of tumors to improve the prevention, diagnosis and treatment of cancer. Its mission is to accelerate the understanding of the molecular basis of cancer through the application of genome analysis and characterization technologies. | Data sources | Standards/Databases Training |
The Data Use Ontology (DUO) | The Data Use Ontology (DUO) describes data use requirements and limitations. DUO allows to semantically tag datasets with restriction about their usage, making them discoverable automatically based on the authorization level of users, or intended usage. This resource is based on the OBO Foundry principles, and developed using the W3C Web Ontology Language. It is used in production by the European Genome-phenome Archive (EGA) at EMBL-EBI and CRG as well as the Broad Institute for the Data Use Oversight System (DUOS). | Data sources | Standards/Databases |
toil-cwl-runner | The toil-cwl-runner command provides cwl-parsing functionality using cwltool, and leverages the job-scheduling and batch system support of Toil. | Data analysis | |
UCSC Genome Browser | An online tool for analyzing and visualizing genomic data. It allows users to add and share annotations. | Data analysis An automated SARS-CoV-... | Tool info Standards/Databases |
VarScan | Variant calling and somatic mutation/CNV detection for next-generation sequencing data | Data analysis | Tool info |
VEP | VEP (Variant Effect Predictor) predicts the functional effects of genomic variants. | Data analysis | Tool info Training |
WfExS | Workflow Execution Service Backend (WfExS-backend) is a high-level orchestrator to run scientific workflows reproducibly. | Provenance | |
WorkflowHub | A registry for describing, sharing and publishing scientific computational workflows. | An automated SARS-CoV-... | Tool info Standards/Databases Training |
wtdbg2 | Wtdbg2 is a de novo sequence assembler for long noisy reads produced by PacBio or Oxford Nanopore Technologies (ONT). | Data analysis | Tool info |
XCMS | Metabolomic and lipidomic platform | Data analysis | Tool info Training |
Zenodo | Zenodo is a generalist research data repository built and developed by OpenAIRE and CERN. It was developed to aid Open Science and is built on open source code. Zenodo helps researchers receive credit by making the research results citable and through OpenAIRE integrates them into existing reporting lines to funding agencies like the European Commission. Citation information is also passed to DataCite and onto the scholarly aggregators. Content is available publicly under any one of 400 open licences (from opendefinition.org and spdx.org). Restricted and Closed content is also supported. Free for researchers below 50 GB/dataset. Content is both online on disk and offline on tape as part of a long-term preservation policy. Zenodo supports managed access (with an access request workflow) as well as embargoing generally and during peer review. The base infrastructure of Zenodo is provided by CERN, a non-profit IGO. Projects are funded through grants. | Data sources | Standards/Databases Training |
COVID Epistat | Dashboard for COVID-19 epidemiological data (vaccination, laboratory testing, wastewater, variants, hospitalised patients, mortality, nursing home patients, mental health indicators, seroprevalence). |
||
DMPonline.be | Tool that provides templates for data management plans.
DMP Online
|
||
Epidemiology of Infectious Diseases (Epistat) | A web based application for visualising and exploring data on infectious diseases monitored by Sciensano. |
||
Figures on notifiable infectious diseases | Dashboard for notifiable infectious diseases in Flanders. |
||
Galaxy Belgium | Galaxy Belgium is a Galaxy instance managed by the Belgian ELIXIR node, funded by the Flemish government, which utilises infrastructure provided by the Flemish Supercomputer Center (VSC).
Galaxy
|
||
R Markdown | Tool at Sciensano for generating weekly and daily reports. |
||
Sciensano LimeSurvey | Tool for online surveys.
LimeSurvey
|
||
Sciensano R Shiny Apps | Shiny is an R package that makes it easy to build interactive web apps straight from R. At Sciensano, several Shiny Apps have been developed to process, analyse and visualise data during the COVID-19 crisis. These include the Surge App (monitoring and data quality of COVID-19 hospitalisations), Indicator App (COVID-19 indicators based on test positivity rates per province), Coverage App (coverage of clinical database on hospitalised patients), Hospital Indicators (forecasting and profile of hospitalised patients), and Quality of reporting (quality indicators of reporting by individual hospitals).
R Shiny
|
||
Clinical-Epidemiological (CE) dataset from an Erasmus MC COVID-19 cohort | Clinical-Epidemiological (CE) data from the Erasmus MC cohort includes 151 PCR-confirmed COVID-19 individuals who were admitted to the hospital with a respiratory infection or respiratory failure in 2020-2021. |
||
COVID RADAR app | Over a period of approximately 2 years (starting May 2020 until February 2022), a total of 280.000 Dutch inhabitants filled in a short questionnaire about Corona-related symptoms, behaviour, vaccination status and test result. |
||
COVID-NL clinical data dashboard | The Dutch national COVID-19 clinical data dashboard allows exploration and reuse of clinical data from Dutch university medical centers (UMCs). The dashboard provides researchers with a clear overview of what is available, allows searching for specific data and makes access to such data easier when the necessary ethical and legal conditions have been met. The policy for access to and sharing of clinical COVID-19 is described in the HRI COVID policy document. |
Standards/Databases | |
COVID-NL metadata portal | The Dutch national COVID-19 metadata portal describes the content of the collections and type of data. The underlying data remains at the source, but where possible a link to the data or the data request procedure are provided on the portal. The first health care data sets in the portal are coming from observational studies funded by ZonMw, NFU COVID-19 clinical research data, collaborating top clinical hospitals (STZ), as well as other regional hospitals. However, the portal is open to any health care provider wishing to make their COVID-19 data available for research. |
Standards/Databases | |
DANS | The Data Archiving and Networked Services (DANS) is the Dutch national centre of expertise and repository for research data. |
||
Erasmus MC COVID-19 cohort-associated connected datasets study | As part of the multidisciplinary ReCoDID consortium, the study aimed at connecting clinical-epidemiological (CE) data with further datasets from research on many other aspects of SARS-CoV-2. |
||
ODISSEI | The Open Data Infrastructure for Social Science and Economic Innovations (ODISSEI) is the national research infrastructure for the social sciences in the Netherlands. |
Standards/Databases | |
FEGA Norway | Federated European Genome-phenome Archive (EGA) node |
||
Feide | The Norwegian portal for secure access and data sharing in education and research. |
||
Folkehelseinstituttet (FHI) | Norwegian Institute of Public Health (NIPH) portal for infectious disease information |
||
Helsedirektoratet | Norwegian directorate of health |
||
Meldingssystem for smittsomme sykdommer (MSIS-forskriften) | Norwegian Surveillance System for Communicable Diseases (including infectious diseases) |
||
Nasjonalt senter for e-helseforskning | Norwegian Centre for e-Health Research |
||
NeLS | Norwegian e-infrastructure for Life Sciences is a national portal for sharing, analysis, and mid-term storage of non-sensitive molecular life science data. |
Tool info | |
Norway’s Tool Assembly about Norwegian e-Infrastructure for Life Sciences (NeLS) | Tool Assembly RDMkit Page on NeLS |
Training | |
Norway’s Tool Assembly about sensitive data service (TSD) | Tool Assembly RDMkit Page on TSD |
Training | |
RDMkit’s national resources page for Norway | RDMkit page on Norway’s RDM guidelines and resources for life scientists |
||
SARS-CoV-2 Database | Norwegian SARS-CoV-2 database |
Standards/Databases | |
Tjenester for Sensitive Data (TSD) | Services for sensitive data, a platform for researchers to collect, store, and analyze sensitive research data |
||
usegalaxy.no | Instance of Galaxy for users in Norway, usegalaxy.no is linked with NeLS. |
Training | |
FEGA Norway | Federated European Genome-phenome Archive (EGA) node
European Genome-phenome Archive (EGA)
|
||
Folkehelseinstituttet (FHI) | Norwegian Institute of Public Health (NIPH) portal for infectious disease information |
||
SARS-CoV-2 Database | Norwegian SARS-CoV-2 database |
Standards/Databases | |
COVID-19 & Pandemic Preparedness Data Portal Sweden | The Swedish COVID-19 & Pandemic Preparedness Data Portal is a hub for data, tools, services, and other resources centred around COVID-19 and pandemic preparedness in Sweden. |