GP2 6th Data Release

jgottesman · February 28, 2024, 10:03pm

I’m sure many folks have seen that the latest round of GP2 data came out a few weeks ago in collaboration with AMP PD: The Components of GP2’s Sixth Data Release - GP2.

Some highlights:

The complex disease data (genotypes), including locally-restricted samples, now consists of a total of 44,831 genotyped participants (24,709 PD cases, 17,246 Controls, and 2,876 ‘Other’ phenotypes)
The monogenic disease data (whole genome sequences) now consists of a total of 2,324 sequenced participants (1,854 PD cases, 314 Controls, and 156 ‘Other’ phenotypes)
12,585 individuals who have deep clinical phenotyping information also have matching genetic information
Additional complex disease (genotyped) and monogenic disease (whole genome) samples
Introducing locally-restricted GDPR samples via the Verily Viewpoint Workbench
Introducing clinical data for ~12,000 individuals
Introducing a new ancestry group → Complex Admixture History (CAH)
Updates in quality control measures for released genotyping data
Updates in variant calling, now with DeepVariant, for released whole genome data

Wondering if anyone is planning to or is already in the process of using data from this new release? If so, would love to hear what you are (or are going to be) working on!

lmackenzie · February 29, 2024, 12:45am

@paularp @psaffie @johanna.junker @joanne.trinh @la.lange Tagging you all as I know you work extensively with GP2!

Are any of you working with data from the new release? Or do you have plans to?

psaffie · February 29, 2024, 1:21am

Yes! I am working in some projects with this data. With a Hackaton project and other GP2 projects that I am updating with this data (Multiancestry PRS) and some pilot studies

joanne.trinh · February 29, 2024, 9:19am

Dear Josh, Thanks for this extensive update on GP2 data. Just to add even more information… there are multiple projects in parallel involving trainees and PIs across the globe. Project proposal applications are also open in GP2 and if anyone has ideas to work on the data it is possible to do so.

paularp · March 1, 2024, 9:28pm

Thanks for sharing this summary!
We are already updating some analysis, but my team is most excited about the addition of metadata, such as IBD! Opening new research avenues!

rooparajan · March 7, 2024, 2:08pm

Look forward to exploring this data, especially the diverse/ complex admixture history cohorts.

jgottesman · March 7, 2024, 2:09pm

Glad to see so much interest! @joanne.trinh , do you have a link to the project proposal opportunity you’re referring to? I know there is the general funding/opportunities page on GP2 Opportunities - GP2 but not sure if this is it?

psaffie · April 6, 2024, 5:32pm

Just wanted to let you guys that release 7 is about to come. As soon as it does, we will keep you posted

Topic		Replies	Views
GP2 10th Data Release Data Sharing and Publications genetic-data , data-sharing , gp2	3	16	July 18, 2025
GP2 8th Data Release Data Sharing and Publications genetic-data , data-sharing , data-release , gp2	2	35	November 12, 2024
Global Parkinson's Genetics Program (GP2) Accessing and Understanding Data genetic-data , data-sharing , data-access , data-analysis , gp2	1	38	November 17, 2023
GP2 for beginners Ideas and Inspiration genetic-data , how-to , gp2	0	34	January 31, 2025
Hackathons: GP2, NCBI, others Ideas and Inspiration genetic-data , data-sharing , meta , data-access , data-analysis	6	46	February 6, 2024

GP2 6th Data Release

Related topics