Hi everyone!
Jody Hesch here from Colorado Springs (originally Albuquerque, NM - with various years spent abroad in the UK, Germany and Australia).
New to PD as of a year ago. I’m a freelance Data Engineer/Architect by profession, and a good friend invited me to help out the startup he’s working at (Alkahest - glad to find several colleagues here as well!).
I’m hoping to be of some assistance to folks here for all things related to data engineering / data modeling / data management. I’m no expert in PD nor relevant fields (i.e. bioinformatics, genetics), but I’ve worked with many, many data science and data analyst teams over the years, helping take a lot of the heavy-lifting of data wrangling/munging off their plates so that they can focus on the analytics themselves.
So if there’s anything folks need help with when it comes to all of the various data plumbing efforts on large projects - data ingestion, data modeling, architecture, DevOps, etc. - let me know!
Happy to help demystify things like data lakes vs. lakehouses vs. warehouses, SQL jiu jitsu, tradeoffs of different platforms (Postgres? Snowflake? Databricks?) etc.
As for PD-specific needs, I spent a fair amount of time over the past year familiarizing myself with both proteomics and RWD (multi-terabyte) datasets, but ofc the fundamentals of data engineering easily extend to any types of data.
Great to be here, looking forward to connecting with folks!
Cheers - and Happy New Year’s (almost)!
Jody