Source Code & Apps
R Library Code
Clustering and cluster optimization algorithms
Purpose: Identifying the ideal number of groups (e.g., cancer patients, cells, social networking groups).
Benefits: Computationally efficient. Conserves on the number of samples needed. Compatible with multiple clustering approaches.
Current applications: Designing clinical trials. Identifying the effectiveness of human cell patterning.
Availability: As of Nov 2015, Progeny Clustering code package is available open source in the R repository: “progenyClust: Finding the Optimal Cluster Number Using Progeny Clustering”, as described in (Hu & Qutub, R Journal 2016). A user-friendly outline of the progenyClust R package is available online here thanks to Ian Howson.
Whole tissue metabolic models
CORDA2 library (Pacific Symposium on Biocomputing, 2017)
Purpose: Provides a method to produce cell and tissue-specific metabolic models
Benefits: Predicts metabolism in healthy and diseased cells based on experimentally-obtained molecular expression data. Faster than CORDA and noise-independent. Optimized for comparing subtypes of human tissues and cells.
Current applications: Identifying differences in metabolism within subtypes of cancer and healthy mammalian cells. Generating patient-specific models.
Library availability: MATLAB formats.
MATLAB: CORDA2 function file as provided in the paper’s supplemental information.
Matrix-Form Artificially Centered Hit and Run, mfACHR (Pacific Symposium on Biocomputing, 2017)
Purpose: Provides a method to improve upon Monte Carlo Sampling of high dimensional network models
Benefits: Speeds up sampling of a model’s solution space compared to prior algorithms.
Current applications: Identifying steady-state metabolic flux distributions within cell and patient-specific whole genome wide metabolic models.
MATLAB: mfACHR function file as provided in the paper’s supplemental information.
CORDA library (PLOS Comp Bio, 2016)
Purpose: Provides a library of tissue specific metabolic models
Benefits: Predicts metabolism in healthy and diseased tissues based on experimentally-obtained molecular expression data
Current applications: Identifying differences in metabolism across cancerous and healthy mammalian tissue
MATLAB: CORDA function file as provided in the paper’s supplemental information.
Python: Python version of CORDA developed by Christian Diener.
corsoFBA (BMC Systems Bio, 2015)
Purpose: Models the flux of metabolites through tissue.
Benefits: Overcomes biomass production assumptions of other FBA methods.
Current applications: Modeling the metabolism of mammalian cells, and changes in disease.
Protein signaling pathway models
HIF1 hydroxylation chemical-kinetic models (2006-2008)
Purpose: Models post-translational regulation of hypoxia-inducible factor 1 (HIF1), a protein activating hundreds of genes as a function of oxygen availability
Benefits: Predicts levels of HIF1 as a function of its cofactors
Current applications: Helping optimize the design of experimental modulation of hypoxia-inducible factor proteins in cancer and neural progenitor cells
Availability: papers & m-files
Purpose: Characterizes multicellular topology from microscopy images. Accessible via Amazon cloud, cytoNet takes as input color or binary images of cells or tissues, and quantifies the spatial relationships in cell communities using principles of graph theory.
Benefits: cytoNet identifies effects of cell-cell interactions on individual cell phenotypes.
Current applications: Understanding cell cycle dynamics in developing neural stem cells, characterizing the response of endothelial cells to neurotrophic factors present in the brain after injury
Availability: cytoNet site
GAIN Neuron Counting App (J Neurosci Methods, 2017)
Purpose: Counting neural progenitor cells and neurons, and measuring dendrite outgrowth from individual cell bodies in images
Benefits: Overcomes limitations of applying prior algorithms to single cell analysis by a rule-based tracking method that maps soma to their dendrites. Interactive friendly graphical user-interface is integrated with the code. For use by all researchers.
Current applications: Automatically processing neuronal cell microscope images to help determine differentiation state of cells in different mechanical environments and in response to drugs.
Interactive Data Visualization Software
Purpose: Visually interpret high-dimensional data through interactive graphs
Benefits: Ease of use & speed. No programming needed. Drag & drop files into the cloud-based tool. Collaborate on biomedical data science projects.
Current applications: Collaboration on data challenges. Expert-informed learning. Teaching tool for clustering and pattern recognition. Clinical decision-making.
Biowheel Clinical & Proteomics Visualization Portal for Leukemia Data (PLOS Comp Bio, 2016) – Open Data Portal
Biowheel Breast Cancer Proteomics Visualization Portal (Nature Methods, 2016) – Open Data Portal