THL Biobank published datasets

THL Biobank is offering access to several datasets used in recent scientific publications utilizing its research collections. The datasets currently include the following projects: The National FINRISK Study’s microbiome project and the Twin Study’s RNA sequencing and methylation data.

These datasets are offered as as used in the analysis of the publications. If you want to combine the data with any samples/data available in the biobank or to link with any additional data from national registers, please submit the application through our standard application process.
THL Biobank application process

Published datasets available in THL Biobank
Dataset ID and publication N of donors Cohort Data types Dataset description, 'read-me-first' file
THLBB2020_PUB001
Palmu et al.
6953 FINRISK metagenomic data; blood pressure treatment; background info** FINRISK 2002 microbiome and blood pressure (pdf 198 kb)
THLBB2020_PUB002*
Alvarez et al.
6 Twin Study single nuclei RNAseq (adipose tissue) Single nuclei RNA sequencing (pdf 220 kb)
THLBB2021_PUB001*
van der Kolk et al.
98 Twin Study RNAseq (adipose tissue, skeletal muscle) RNAseq (pdf 171 kb)
THLBB2021_PUB002
Sillanpää et al.
123 Twin Study DNA methylation; sports, work and leisure index; background info Epigenetic clocks (pdf 125 kb)
THLBB2021_PUB003
Salosensaari et al.
7055 FINRISK metagenomic data; mortality; background info** Taxonomic signatures of cause-specific mortality risk in human gut microbiome (pdf 118 kb)
THLBB2021_PUB004
Ruuskanen et al.
6269 FINRISK metagenomic data; background info** Gut microbiome composition and fatty liver disease (pdf 148 kb)
THLBB2021_PUB005
Koponen et al.
4930 FINRISK metagenomic data; dietary data; background info** Associations of healthy food choices with gut microbiota profiles (pdf 132 kb)
THLBB2021_PUB006
van Dongen et al.
2216 Twin Study raw DNA methylation intensity data; methylation beta-values; background info Epigenetic signature of early genome programming in identical twins (pdf 141 kb)
THLBB2021_PUB007
Kankaanpää et al.
2240 Twin Study Processed DNA methylation data; sports, work and leisure index; background info Epigenetic clocks and lifespan sex differences (pdf 144 kb)

* Please note that this dataset is given as such and cannot be combined with any other data.
** Please note that this dataset requires in addition to biobank permission a separate permission from Finnish Social and Health Data Permit Authority, Findata.

Accessing published datasets

The datasets hosted by THL Biobank contain sensitive data related to biobank sample donors, and therefore we apply restrictions on their use. The permission to access the datasets as such is applied through THL Biobank Application Portal REMS (catalogue item: "5. OTHER COLLECTION: THL Biobank published datasets") and will be charged according to biobank's service prices.  Once your access request is approved in the biobank, we will send you a data transfer agreement that should be signed by your home organization. Access to the data is provided once the agreement is signed.
General terms of access to THL Biobank resources
THL Biobank Application Portal
THL Biobank Service Prices

Certain datasets contain also variables derived from national health register data. Permission to access register-based data requires project specific permission from Finnish Social and Health Data Permit Authority, Findata. Feel free to contact THL Biobank if you need help with the Findata application.
Findata

Please note that the information on individuals whose biobank consent is no longer valid will be removed from the provided dataset, if necessary.

Contact

admin.biobank ( at ) thl.fi