Publication | IEEE Transactions on Visualization and Computer Graphics 2018
PhenoLines
Phenotype Comparison Visualizations for Disease Subtyping via Topic Models
Abstract
PhenoLines: Phenotype Comparison Visualizations for Disease Subtyping via Topic Models
Michael Glueck, Mahdi Pakdaman Naeini, Finale Doshi-Velez, Fanny Chevalier, Azam Khan, Daniel Wigdor, Michael Brudno
IEEE Transactions on Visualization and Computer Graphics 2018
PhenoLines is a visual analysis tool for the interpretation of disease subtypes, derived from the application of topic modelsto clinical data. Topic models enable one to mine cross-sectional patient comorbidity data (e.g., electronic health records) and constructdisease subtypes—each with its own temporally evolving prevalence and co-occurrence of phenotypes—without requiring alignedlongitudinal phenotype data for all patients. However, the dimensionality of topic models makes interpretation challenging, and defacto analyses provide little intuition regarding phenotype relevance or phenotype interrelationships. PhenoLines enables one tocompare phenotype prevalence within and across disease subtype topics, thus supporting subtype characterization, a task that involvesidentifying a proposed subtype’s dominant phenotypes, ages of effect, and clinical validity. We contribute a data transformation workflowthat employs the Human Phenotype Ontology to hierarchically organize phenotypes and aggregate the evolving probabilities producedby topic models. We introduce a novel measure of phenotype relevance that can be used to simplify the resulting topology. The designof PhenoLines was motivated by formative interviews with machine learning and clinical experts. We describe the collaborative designprocess, distill high-level tasks, and report on initial evaluations with machine learning experts and a medical domain expert. Theseresults suggest that PhenoLines demonstrates promising approaches to support the characterization and optimization of topic models.
Download publicationRelated Resources
2016
Embedded sensors and feedback loops for iterative improvement in design synthesis for additive manufacturingDesign problems are complex and not well-defined in the early stages…
2011
Leveraging Cloud Computing and High Performance Computing Advances for Next-generation Architecture, Urban Design and Construction ProjectsArchitecture and urban design projects are constantly breaking…
2018
Investigating How Online Help and Learning Resources Support Children’s Use of 3D Design Software3D design software is increasingly available to children through…
Get in touch
Something pique your interest? Get in touch if you’d like to learn more about Autodesk Research, our projects, people, and potential collaboration opportunities.
Contact us