THL Biobank published datasets
THL Biobank is offering access to several datasets used in recent scientific publications utilizing its research collections. The datasets currently include the following projects: The National FINRISK Study’s microbiome project and the Twin Study’s RNA sequencing and methylation data.
These datasets are offered as as used in the analysis of the publications. If you want to combine the data with any samples/data available in the biobank or to link with any additional data from national registers, please submit the application through our standard application process.
THL Biobank application process
Dataset ID and publication | N of donors | Cohort | Data types | Dataset description, 'read-me-first' file |
---|---|---|---|---|
THLBB2020_PUB001 Palmu et al. |
6953 | FINRISK | metagenomic data; blood pressure treatment; background info** | FINRISK 2002 microbiome and blood pressure (pdf 198 kb) |
THLBB2020_PUB002* Alvarez et al. |
6 | Twin Study | single nuclei RNAseq (adipose tissue) | Single nuclei RNA sequencing (pdf 220 kb) |
THLBB2021_PUB001* van der Kolk et al. |
98 | Twin Study | RNAseq (adipose tissue, skeletal muscle) | RNAseq (pdf 171 kb) |
THLBB2021_PUB002 Sillanpää et al. |
123 | Twin Study | DNA methylation; sports, work and leisure index; background info | Epigenetic clocks (pdf 125 kb) |
THLBB2021_PUB003 Salosensaari et al. |
7055 | FINRISK | metagenomic data; mortality; background info** | Taxonomic signatures of cause-specific mortality risk in human gut microbiome (pdf 118 kb) |
THLBB2021_PUB004 Ruuskanen et al. |
6269 | FINRISK | metagenomic data; background info** | Gut microbiome composition and fatty liver disease (pdf 148 kb) |
THLBB2021_PUB005 Koponen et al. |
4930 | FINRISK | metagenomic data; dietary data; background info** | Associations of healthy food choices with gut microbiota profiles (pdf 132 kb) |
THLBB2021_PUB006 van Dongen et al. |
2216 | Twin Study | raw DNA methylation intensity data; methylation beta-values; background info | Epigenetic signature of early genome programming in identical twins (pdf 141 kb) |
THLBB2021_PUB007 Kankaanpää et al. |
2240 | Twin Study | Processed DNA methylation data; sports, work and leisure index; background info | Epigenetic clocks and lifespan sex differences (pdf 144 kb) |
* Please note that this dataset is given as such and cannot be combined with any other data.
** Please note that this dataset requires in addition to biobank permission a separate permission from Finnish Social and Health Data Permit Authority, Findata.
Accessing published datasets
The datasets hosted by THL Biobank contain sensitive data related to biobank sample donors, and therefore we apply restrictions on their use. The permission to access the datasets as such is applied through THL Biobank Application Portal REMS (catalogue item: "5. OTHER COLLECTION: THL Biobank published datasets") and will be charged according to biobank's service prices. Once your access request is approved in the biobank, we will send you a data transfer agreement that should be signed by your home organization. Access to the data is provided once the agreement is signed.
General terms of access to THL Biobank resources
THL Biobank Application Portal
THL Biobank Service Prices
Certain datasets contain also variables derived from national health register data. Permission to access register-based data requires project specific permission from Finnish Social and Health Data Permit Authority, Findata. Feel free to contact THL Biobank if you need help with the Findata application.
Findata
Please note that the information on individuals whose biobank consent is no longer valid will be removed from the provided dataset, if necessary.
Contact
admin.biobank ( at ) thl.fi