Nucleotide and Protein Sequences

  • Genbank - Nucleotide sequence database at NCBI
  • ENA - European Nucleotide Archive at EMBL-EBI
  • DDBJ - DNA Data Bank of Japan
  • Protein database at NCBI
  • UniProt - a resource of protein sequence and functional information

Genome Downloads

Genome Browsers

Gene Databases

  • Gene - a gene database at NCBI
  • UniGene - a transcript database at NCBI
  • SOURCE - a gene database
  • GeneAtlas - a gene database
  • GeneCards - human gene database
  • RefGene - a gene database with comprehensive gene-related information
  • OMIM - An Online Catalog of Human Genes and Genetic Disorders
  • AmiGo - a gene ontology database
  • HGNC - HUGO Gene Nomenclature Committee
  • GENCODE - human and mouse gene annotation database

Gene Mutations and Variations

  • dbSNP - a database of single nucleotide polymorphisms (SNPs) and small-scale insertions/deletions, microsatellites, and non-polymorphic variants.
  • MutDB - a database for assessing the impact of genetic variants
  • HGMD - The Human Gene Mutation Database
  • COSMIC - the Catalogue Of Somatic Mutations In Cancer
  • GWAS Central - a centralized compilation of summary level findings from genetic association studies
  • DGV - a curated catalogue of human genomic structural variation


Gene Regulatory Elements

  • TRED - Transcriptional Regulatory Element Database
  • EPD - Eukaryotic Promoter Database
  • JASPAR - transcription factor binding profile database
  • ENCODE - Encyclopedia of DNA Elements
  • TRRUST - a manually curated database of human transcriptional regulatory network
  • FANTOM5 promoterome
  • Cistrome - integrative analysis pipelines to better mine the hidden biological insights from publicly available high throughput data
  • ChIP-Atlas - integrative and comprehensive database for visualizing and making use of public ChIP-seq data
  • GTRD - Gene Transcription Regulation Database
  • ChIPBase - a database for studying the transcription factor binding sites and motifs, and decoding the transcriptional regulatory networks of lncRNAs, miRNAs, other ncRNAs and protein-coding genes from ChIP-seq data

Non-Coding RNA Databases and Tools

  • miRBase - a searchable database of published miRNA sequences and annotation
  • miRTarBase - the experimentally validated microRNA-target interactions database
  • miRDB - online database for miRNA target prediction and functional annotations
  • TargetScan - prediction of microRNA targets
  • RNA22 - microRNA target detection
  • DIANA Tools - databases of experimentally verified miRNA targets and prediction tools

Protein Function and Regulation

  • KEGG PATHWAY Database
  • NetPath - manually curated resource of signal transduction pathways in humans
  • Reactome - a signaling pathway database
  • WikiPathways - a database of biological pathways
  • Phospho.ELM - phosphorylation site database
  • PHOSIDA - phosphorylation site database
  • dbPAF - database of phospho-sites in animals and fungi
  • PhosphoNET - human phosphosite knowledgebase
  • IntAct - a protein interaction database at EBI
  • BioGRID - a repository for interaction datasets
  • STRING - known and predicted protein-protein interactions
  • PaxDb - Protein Abundance Database

Structure Databases

Other Resources


Sequence Alignment

Primer Design

siRNA/shRNA Design


RNA Secondary Structure Prediction

Functional Genomics

  • DAVID - a comprehensive set of functional annotation tools
  • Enrichment analysis at the Gene Ontology Consortium web site
  • GSEA - Gene Set Enrichment Analysis
  • Enrichr - interactive and collaborative gene list enrichment analysis tool
  • WebGestalt - WEB-based GEne SeT AnaLysis Toolkit
  • PANTHER - Protein ANalysis THrough Evolutionary Relationships
  • GeneMANIA - biological network integration for gene prioritization and predicting gene function
  • InterMine - open source data warehouse system for the integration and analysis of biological data. Instances of InterMine automatically provide enrichment analysis for uploaded sets of genes

Promoter Analysis

Protein Analysis

  • ProtParam - protein physical and chemical parameters
  • SAPS - statistical analysis of protein sequences
  • NetPhos 3.1 - predicts serine, threonine or tyrosine phosphorylation sites in eukaryotic proteins
  • KinasePhos2.0 - phosphorylation site prediction tool
  • ProteinGuru - Global e-Utility and Resource Unit for Protein Research
  • GPMAV lite - protein physical and chemical parameters

Multiple Tools Websites


In Silico Molecular Cloning

Genomics, Transcriptomics, NGS Data

Just a few examples. For comprehensive listings use dedicated websites, e. g. here.

Graphical user interface

  • IGV - Integrative Genomics Viewer (NGS data visualization on genomes and more)
  • Artemis - a free genome browser and annotation tool
  • AltAnalyze - end-to-end analysis of single-cell and bulk RNA-Seq data
  • FunRich - software tool used mainly for functional enrichment and interaction network analysis of genes and proteins

Command line interface

  • Samtools - a suite of programs for handling high-throughput sequencing data files
  • Cutadapt - removes adapter sequences, primers, poly-A tails and other types of unwanted sequence from high-throughput sequencing reads
  • STAR - ultrafast universal RNA-seq aligner
  • HISAT2 - a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA) to a population of human genomes (as well as to a single reference genome)
  • BBTools - a suite of fast, multithreaded bioinformatics tools designed for analysis of DNA and RNA sequence data
  • GeneSCF - Gene Set Clustering based on Functional annotation

Cloud-based, with web browser interface

  • Galaxy - web-based platform for data intensive biomedical research
  • GenomeSpace - interoperability framework to support integrative genomics analysis through an easy-to-use Web interface
  • GenePattern - provides hundreds of analytical tools for the analysis of gene expression (RNA-seq and microarray), sequence variation and copy number, proteomic, flow cytometry, and network analysis
  • WebMeV - a cloud-based application supporting analysis, visualization, and stratification of large genomic data, particularly for RNA-Seq and microarray data
  • BaseSpace - cloud-based genomics analysis and storage platform that directly integrates with all Illumina sequencers

Biological Systems Modeling

  • Cytoscape - software platform for visualizing complex networks
  • NetWalker - network analysis suite for functional genomics
  • PathVisio - a tool to edit and analyze biological pathways
  • CellDesigner - a modeling tool of biochemical networks
  • COPASI - simulation and analysis of biochemical networks and their dynamics
  • The Virtual Cell - virtual cell modeling & analysis

Statistical Data Analysis

  • R - a free software environment for statistical computing and graphics
  • PSPP - a program for statistical analysis of sampled data
  • JASP - a free alternative to SPSS
  • Past - free software for scientific data analysis
  • Orange - open source data visualization and data analysis
  • WinBUGS - flexible software for the Bayesian analysis of complex statistical models
  • JAGS - a program for analysis of Bayesian hierarchical models using Markov Chain Monte Carlo simulation
  • SHOGUN - a large scale machine learning toolbox
  • ADaMSoft - a free system for data management, data and web mining, statistical analysis and more

Image Analysis and Graphics

  • Fiji - open source image analysis software package
  • Cell Profiler - open-source software for quantitative analysis of biological images
  • CellTracker - an image processing software to perform automated, semi-automated, and manual cell migration detection
  • GIMP - GNU Image Manipulation Program
  • Inkscape - open-source professional vector graphics editor for Windows, Mac OS X and Linux


Repositories and Vendors

Maps and Sequences


Your browser does not support SVGs