Efficient development of certified diagnostic laboratory developed tests using proteomic data


Proteomic technology is a powerful biological tool with established methods for identifying proteomic biomarkers, but the development of certified diagnostic clinical tests based on proteomic biomarkers can be time-consuming, prone to overfitting issues, and difficult to navigate. We demonstrate the utility of combining pipeline tools, statistical learning techniques, and a knowledge base of in-silico proteomic datasets into a reproducible workflow that allows for efficient development of LDT-certifiable tests using SomaScan® technology.


Data pipeline and analysis tools were developed using R, in conjunction with proteomic measurements obtained using the SomaScan Platform. The tools take the analyst from data processing and QC through identification of optimized models for prediction of clinical endpoints, and then through validation on a hold-out test set. The tools include an assessment of model robustness against sample handling issues, longitudinal stability, the impacts of assay noise on model performance, effects of putative interferents, and risk of failure during CLIA validation in the lab. Real-life examples of clinical applications demonstrate the effectiveness of the tool in reducing analysis time and increasing model accuracy.


Analysis time for identifying the optimal proteomic model to validation was reduced by at least 80%, with decreased prediction variability by up to 90%. In at least 75% of cases, application of in-silico data allows for tuning of predictive models to ensure robustness in a variety of everyday settings. This tool has led to 16 LDT certified SomaLogic tests in the last 3 years, ranging from anthropometric
measurements to cardiovascular- and cancer-risk predictions.


Not only are powerful, proteomics-driven, diagnostic tests realizable, but they can be LDT certified in an efficient, reproducible manner and made to be robust to real-life variability. Efficient analysis tools allow us to leverage proteomic technology in new ways, leading to tests that can be used for precision medicine applications.


Y. Hagar
L.E. Alexander
C. Scheidel
A. Zhang
J. Gogain
C. Paterson
R. Ostroff
M.A. Hinterberg

SomaLogic Operating Co., Inc., Boulder, CO, USA

Share with colleagues

More posters

PosterThe Plasma Proteome as a Cardiovascular Disease Risk Assessment Tool in Cancer Survivors

Cardiovascular disease (CVD) is the most common non-cancer cause of death in cancer survivors and there is an unmet clinical need for easy, accurate, and safe CVD prognostic risk-stratification in adult cancer survivors. This study investigated whether a previously validated 27-plasma protein prognostic model for four-year cardiovascular (CV) events could have such a utility.

Learn more

PosterEfficient development of prognostic tests for detecting cancer risk using proteomic technology

Prognostic models for assessing future health outcomes can be developed using time-to-event (also known as “survival”) data. This methodology is ubiquitous in statistical literature and in the analysis of cancer outcomes, but its use in high-dimensional analyses tends to be limited as the methods are difficult to implement in a machine learning environment. Additionally, development of certified prognostic clinical tests using proteomic biomarkers for detecting future cancer risk can be time-consuming, prone to overfitting issues, and difficult to navigate. We demonstrate the utility of combining SomaScan® proteomic data with pipeline machine learning tools and survival analysis methodology to identify powerful and robust LDT-certifiable prognostic tests for assessing future risk of cancer.

Learn more

PosterPredicting risk of future events in individuals with chronic coronary syndromes

Evaluate whether a previously validated 27-protein prognostic model for four-year cardiovascular event risk can be used to stratify patients with suspected chronic coronary syndrome (CCS)

Learn more

Explore posters in our interactive viewer