PDMB Sequencing report, potential outliers

Hi again @mattk,

I’m back with a couple more PDMB & sequencing-related questions I was hoping you could help me with.

First, I’m looking to include some sort of sequencing report in my methods. The information in the 2024 data paper was a good start, but other information like what fraction of the reads were human, what was in the controls, and if any samples were removed for reasons other than insufficient read depth (etc) would be appreciated.

Second, in analysing the PDMB stool/gut data I found a group of what look like outliers - 42 samples with much lower sequencing depth than the rest of the samples (median 124k reads vs 1.45m) . This group also seems to have more taxa associated with the oral microbiome than the gut microbiome, as in lots of Streptococcus, Rothia, Neisseria, Actinomyces, etc. Is it possible that these samples were misidentified as stool samples? Moving forward, would you suggest excluding these from analysis? The sample names are below, thanks for your help! - Adam

FOX_928949
FOX_468215
FOX_563676
FOX_337847
FOX_498006
FOX_563567
FOX_489847
181
088
192
FOX_739702
130
201
FOX_808609
242
FOX_963562
FOX_600771
FOX_644064
FOX_712622
FOX_112824
FOX_317663
FOX_677694
FOX_912118
063
FOX_283957
FOX_284498
FOX_589353
FOX_096521
FOX_253293
FOX_499510
FOX_380673
FOX_876367
FOX_917197
FOX_519587
FOX_428825
FOX_741697
FOX_354214
FOX_562209
FOX_489040
243
168
274
2 Likes

Hi Adam (@alemkow),

Thanks for reaching out. I’m taking a look at these questions and will get back to you.

Best,

Matt

3 Likes