Bioinformatics
Datasets commonly used by bioinformatics domains
Alphafold databases
Colabfold databases
dfam
infoOpen collection of Transposable Element DNA sequence alignments, hidden Markov Models (HMMs), consensus sequences, and genome annotations.
folder_open
/datasets/bio/dfam/
Eggnog
infoA database of orthology relationships, functional annotation, and gene evolutionary histories.
folder_open
/datasets/bio/eggnog-data/
folder_open
/datasets/bio/eggnog6-data/
NCBI NT, NR, Eukaryotic, and Prokaryote databases
infoNCBI’s databases are downloaded weekly. See the full details for more information.
folder_open
/datasets/bio/ncbi-db/
Tara Oceans
infoTara Oceans
folder_open
/datasets/bio/tara-oceans/MGT-transcriptomes/
folder_open
/datasets/bio/tara-oceans/MATOU-gene-catalog/