White paper: Smart Transformation of Clinical & Nonclinical Data for Insights

Disparate data with inconsistent data models, terminologies, and unstructured descriptions from studies need to be ingested into a searchable data store. Smart transformation replaces fixed adaptors and mappers as an important part of curation to make study data searchable across studies to gain insights.

Smart transformation uses machine learning to transform clinical, nonclinical and biomarker data from data lakes to a target model with automation. Supervised, expertly curated datasets train multiple deep neural network models that transform disparate source data. Recommendation engines using ontologies and vocabularies referenced in the target data model definition harmonize the transformed data. The smart transformers continually improve, learn and adaptively evolve as data managers intervene, assert or correct errors in transformation or users make decisions on metadata, content and terminology recommendations. This artificial intelligence augmented automation promotes data normalization and harmonization for search analytics as well as for regulatory packaging of eData.