Presented by: Monica Beato Coelho, Rancho BioSciences
Data Loading for Beginners” training session will start by a brief introduction of tM user interface and functionalities such as browse and search datasets loaded, variable types and patient cohort selection, saving queries, summary statistics, grid view and advanced workflow. tranSMART curation workflow will be discussed and we will use sample data (GEO data) to demonstrate how to transform and map dataset into tM navigation tree. Demonstration of the curation process required to load data types supported by tM will include clinical data curation, for which we will highlight the importance of data cleaning and mapping clinical variables using defined ontologies, and how to prepare high dimensional data files using gene expression as an example. After preparing the data files we will show the folder/file structure required to load data using tM ETL tool, and how to use the ETL tool (main commands) and how to troubleshoot data loading issues.