The joint data science program has a self-maintained computational cluster capable of processing and storing the large volumes of data produced by the Combinatorial Drug Discovery Program (CDDP), the Center for Advanced Imaging (CAI), Center for Advanced Microscopy and Image Informatics (CAMII), High-throughput Flow Cytometry Program (HtFCP), and the Microphysiological Lead Optimization and Toxicity screening facility (MLOTS) housed at the Texas A&M Institute of Biosciences and Technology. Central to this computational infrastructure is Pipeline Pilot Software (BIOVIA), which provides shared access of CPUs, GPUs, and RAM across decentralized workstations in addition to providing a wide array of software development and data management tools. In total this cluster contains over a terabyte of RAM, 248 logical processors, and 3 high-capacity GPU clusters. The joint data science program also maintains an actively expanding database containing a wide array of pharmacologic susceptibility data from 100s of unique cell line models and cell culture systems, harmonized chemical annotation information for multiple databased, raw annotated images from multiple research programs, and a large code base from applications ranging from basic statistics to advanced deep learning. Collectively, these resources server as a proprietary platform to rapidly develop technologies and uniquely contextualize information being generated by our core facilities.
Statistical services
Rigor and reproducibility analysis
Assay design optimization
Data normalization and integration
Exploratory data analysis
Development of Machine learning/Deep Learning models
Image analysis services
Development of custom image analysis pipelines
Upscaling established protocols for HTS applications
Established protocols for 2D monolayers, organoids, spheroids, and time lapse data analysis