Thank you very much @ehutchins and @hirotaka! Your answers were very clarifying and explanatory, and I was able to see that there are more details that I still need to learn about patient designation. I’ll take advantage of my response to confirm whether I’ve understood correctly what you told us and to summarize the steps. If you notice any mistakes or have additional information, just let me know!
After all, how should I recognize each individual person in the PPMI?
1 - Go to LONI where the PPMI data is and download all the files available under ‘START_HERE’
2 - Download any data you deem necessary and want to analyze that contains the PATNO of the participants
3 - Use some technique to join databases based on PATNO using the (1) database that you want to analyze and the (2) specific spreadsheet of your interest present in PPMI_Consensus_Committee_Analytic_Datasets_08May2023 (downloaded in START_HERE - for example: using the spreadsheet of patients with PD)
4 - If you are analyzing PD, for example: If you aim to select patients with clinically manifest PD regardless of their genetic status, use the CONPD column for this. If you want to choose patients with idiopathic PD, use this same column but exclude individuals who also have positive results (1) in other genetic columns ([‘CONLRRK2’, ‘CONGBA’, ‘CONSNCA’, ‘CONPRKN’, ‘CONPINK1’])
More information can be found in the document called Guide to the PPMI analytic dataset (also present in START_HERE).
Specific Questions:
1 - The document PPMI_Consensus_Committee_Analytic_Datasets_08May2023 mentions there being 788 idiopathic PD, however, after selecting all patients with CONPD = 1 and excluding patients with a value of 1 in the genetic columns described above, I ended up with 791 patients - why did I have 3 more than described?
2 - From what I could see, the variables in Primary_Clinical_Diagnosis represent a clinical diagnosis given by the local investigator at the time he evaluated the patient, at each follow-up. It seems that the PPMI committee reviews each case where there is diagnostic doubt and reaches a consensus if the patient actually has PD. The number of suspicions of another diagnosis ends up being much higher than the change that the PPMI committee actually concludes, which, according to the document, were only 2. Attached, I leave a preliminary analysis with the latest available data for each patient regarding their “Primary Clinical Diagnosis” for you to see that a very small amount of these alternative diagnoses are considered.

I took the opportunity to combine the data from patients defined as without PD in PPMI_Consensus_Committee_Analytic_Datasets_08May2023 and who were not part of the genetic group with that of Primary Clinical Diagnosis, obtaining the following result:

My opinion: The use of Primary Clinical Diagnosis to explore differences between PD patients and those with assumptions of alternative diagnoses is possible, but the PPMI committee most often does not agree with the change of diagnosis, making this analysis, therefore, very exploratory.
Also, an additionnal question: is there any problem in revealing participants PATNO in my posts in this forum? Does it goes against some data sharing rule?