class: left, name:opening # NeuroData Joshua T. Vogelstein & Vikram Chandrashekhar
.foot[[jovo@jhu.edu](mailto:jovo at jhu dot edu) |
] --- class: center ###
What is NeuroData?
###
Goal: Build tools that work on .r[your] data
--- ### Why is it hard? - classical statistic theory .r[requires]: - p dimensions is smaller than n samples - your data has .r[p >> n] - recent statistical theory for p >> n, - but with .r[inaccurate assumptions] for your data - those methods work poorly on your data - we develop methods that: - statistically valid - computationally efficient - for your data --- ### Some Examples 1. Hypothesis Testing 2. Dimensionality Reduction 3. Classification 3. Connectome Coding 4. CLARITY & COLM For 1-3, we have strong theory that I can discuss ad nauseam --- ### Hypothesis Testing ##### Case A: Test whether X is independent of Y X is CLARITY brain, and Y is condition (eg, control vs. addicted) -- ##### Case B: Test whether population A is the same as population B Pop A are control CLARITY brains, Pop B are addicted CLARITY brains.
(this is akin to, and a generalization of, a t-test) -- #### Main Idea .r[MGC]: Compare "correlations" between all .r[local] pairs of points --
ps - like Marr-Albus Sparse Expansion Theory --
pps - can be used for "CCA", eg, behavior vs neural activity --- ### MGC Empirically Dominates
--- ### Dimensionality Reduction Given pairs of high-dimensional X and corresponding class labels Y, find the best low-dimensional representation of X -- #### Main Idea .r[LOL]: Use the means and PCA of .r[each class separately] --- ### LOL Empirically Dominates
--- ### Classification Given pairs of high-dimensional X and corresponding class labels Y, find the best discriminant boundary -- #### Main Idea .r[RerF]: Find many good .r[sparse] projections of data --- ### RerF Empirically Dominates .pull-left[ - random forests (RF) were best - recent extensions improve - we are even better ] .pull-right[
] --- ### Connectome Coding
Characterizing the relationship between the *past environment* and the *present neural connectivity* --
(could be relevant for Callaway and Justus stuff?) --- ### CLARITY / COLM Build comprehensive ecosystem CLARITY/COLM data, including - storage & management - visualization - registration to atlas and one another - cell detection & counting - tractography --- ### Storage & Management
infinitely scalable in cloud --- ### Visualization .pull-left[ - Google's NeuroGlancer' - 3D pan, zoom, & rotate - Multi-channel overlays - Select individual ROIs - 30 minutes to load once ] .pull-right[
] --- ### Registration
- only ~1 hr per brain - fully automatic (no landmarks) - works on iDisco and other species too --- ### Cell Detection
--- ### Cell Detection Analysis
--- ### Tractography
--- class: middle #
[demo](https://tinyurl.com/yco8h787)
--- class: top, left ### Acknowledgements
Carey Priebe
Randal Burns
Michael Miller
Brian Caffo
Michael Milham
Daniel Tward
Minh Tang
Avanti Athreya
Vince Lyzinski
Daniel Sussman
Yichen Qin
Youngser Park
Cencheng Shen
Shangsi Wang
Greg Kiar
Eric Bridgeford
Vikram Chandrashekhar
Tyler Tomita
James Brown
Disa Mhembere
Drishti Mannan
Jesse Patsolic
Benjamin Falk
Kwame Kutten
Eric Perlman
--- ### Questions? Now hiring! .pull-left[ | task | link | | --- | --- | | testing | [MGC](https://github.com/neurodata/mgc) | | dim red | [LOL](https://github.com/neurodata/LOL) | | classify | [RerF](https://github.com/neurodata/R-RerF/) | | conn code | [GraphStats](https://github.com/neurodata/graphstats/) | CLARITY | [COBALT](https://github.com/neurodata/COBALT) | | email | [jovo@jhu.edu](mailto:jovo@jhu.edu) | | web | [neurodata.io](http://neurodata.io/) | | startup | [gigantum.com](http://gigantum.com/) |
♥, 🦁, 👪, 🌎, 🌌
]