Proteomic models to predict pre-analytical variation


In biomarker discovery, it is critical to assess any pre-analytical variation (PAV) in order to avoid artificial bias in the intended measurements. PAV may arise from both avoidable and unavoidable factors, resulting in misleading data and incorrect conclusions. Proteins, in particular, are vulnerable to variation in collection methods, storage temperatures, and processing protocols. It is vitally important to understand this PAV when analyzing samples using protein assays.


Human EDTA plasma and serum samples, subjected to standardized sample processing methods, with distinct excursions from ideal collection, were assayed on the SomaScan® Platform measuring ~7,000 analytes. Using machine-learning methods, these quantitative protein measurements were compared to sample processing truth standards (eg, time-to-spin) to create predictive models. These models, termed SomaSignal® tests (SSTs), were developed to enable the assessment of PAV related to processing methods.


SomaSignal tests (SSTs) have been developed to predict time-to-spin, time-to-decant and time-to-freeze, reported in the number of hours, for both plasma and serum. Models that predict the number for freeze/thaws a sample has been subjected to, have also been developed. All eight models had Lin’s CCC and R2 values greater than 0.90 in hold-out validation datasets. In addition to these sample handling predictions, effect size calculations for all ~7,000 measurements have been determined for multiple time points, or freeze-thaw cycles, for each model.


SomaLogic has developed a unique class of PAV models that are able to assess variation related to processing methods. Results from these predictions can be used during biomarker evaluations to exclude samples due to apparent excessive delay in processing, identify collection site bias for current and future analysis and identify sample groupings that may impact analysis. Further, knowing the effect size metrics for all measurements could also enable the removal of specific analytes from modeling and/or be used as covariates in model development.


David Astling
Dan Drolet
Joe Gogain
Yolanda Hagar
Laura Sampson
Kaitlin Soucie
Kinsey Trinder
Ira von Carlowitz
Matthew Westacott

SomaLogic Operating Co., Inc., Boulder, CO USA

Share with colleagues

More posters

PosterLatest research shows benefit of non-invasive, high-plex protein profiling for liver disease

Learn about using high-plex, aptamer-based protein profiling for NASH research through these three SomaLogic assets. Watch one webinar and download two posters that each highlight NASH research.

Learn more

PosterProteomic Indicators of Metabolic Health in Diabetes and Social Deprivation

Understanding the health impacts of socioeconomic deprivation (SED) and its interaction with type 2 diabetes is important for patient care and effective public health initiatives. Large-scale proteomic profiling using aptamer-based technology to measure 7,000 proteins has facilitated the development of blood-based proteomic signatures for 11 cardiometabolic SomaSignalTM Tests (SST)

Learn more

PosterHeritability, pQTLs, and environmental influence on proteins involved in age, cardiovascular risk, and glucose tolerance using the SomaScan® Assay

Protein quantitative trait locus (“pQTL”) studies identify genetic variants that are statistically associated with protein levels. Results from the growing number of pQTL studies can be combined with genome-wide association studies to identify proteins that underlie the genetic risk of disease, thus revealing the mechanisms of disease and potential drug targets.

Learn more

Explore posters in our interactive viewer