class: center, middle name:opening ## Opportunities and Challenges in Big Data Neuroscience
.center[ Joshua T. Vogelstein
{[bme](http://www.bme.jhu.edu/),[icm](http://icm.jhu.edu/),[cis](http://cis.jhu.edu/),[idies](http://idies.jhu.edu/),[kavli](http://kavlijhu.org/),[cs](http://engineering.jhu.edu/computer-science/), [ams](http://engineering.jhu.edu/ams/), [neuro](http://neuroscience.jhu.edu/)} | [jhu](https://www.jhu.edu/)
questions: [jovo@jhu.edu](mailto:jovo at jhu dot edu)
slides:
Co-Founder: [NeuroData](http://neurodata.io) & [gigantum](http://gigantum.io) ] --- ## Introductory Comments
- Please interrupt with questions! - Please come work with us (postdocs & software engineers)! --- ## Outline - NeuroData Challenges - Science 2.0 --- class: center, middle # NeuroData --- ### Challenges
--- class: center, middle # Collect ---
---
---
---
---
--- class: center, middle # Store --- ### Multiresolution Space Filling Curve
--- ## Benchmark
---
---
--- class: center, middle # Wrangle ---
--- class: center, middle # Explore ---
---
---
---
--- class: center, middle # Model ---
---
--- ### Challenges
--- class: center, middle # Science 2.0 --- ## Where are we now?
- Data in the Cloud - Pipelines in the Cloud - Science in the Cloud ---
http://cloud.neurodata.io/nd/ca/kasthuri11cc/xy/2/2056,2856/3296,3896/2/
--- ## Data in the Cloud
Anybody in the world can: - [Visualize](http://viz.neurodata.io/project/Fear199/4/250/250/0/) - [Download](http://cloud.neurodata.io/nd/ca/kasthuri11cc/xy/2/2056,2856/3196,3996/2/) arbitrary subvolumes But not yet easily - Reproduce & extend analysis --- ## Pipelines in the Cloud
--- ## Pipelines in the Cloud
| Requirement | Implementation | | :--- :--- | | Data in Cloud | Amazon S3 bucket | | Data Spec | BIDS | | Notebook | Jupyter | | Virtualization | Docker | | Deployment | Always up | | Cloud Computing | Amazon EC2 | --- ## Pipeline in the Cloud
Anybody in the world can: - [Visualize](http://viz.neurodata.io/) - [Download](http://docs.neurodata.io/ndstore/) arbitrary subvolumes - [Reproduce and extend](scienceinthe.cloud) analysis But not yet quite: - "Continuously integrated" --- ## Science in the Cloud
- Maintain a single "science in the cloud" digital experiment -- - Make "tests" to perform quality control for new data & analyses -- - Every new contribution should be tested before integrated -- - Everything should be able to run locally as well as in the cloud -- - Easy for anybody else to benefit/contribute --- ## Science in the Cloud 1. Globally democratizes science 2. Accelerates discovery
--- # References
- [Science in the Cloud](https://academic.oup.com/gigascience/article-lookup/doi/10.1093/gigascience/gix013)...arXiv, 2016 - [To the Cloud!](https://doi.org/10.1016/j.neuron.2016.10.033)... Neuron, 2016 - [Cosmos to Connectomes](http://www.sciencedirect.com/science/article/pii/S0896627314007466)...Neuron, 2014 - [Open Connectome Project](https://arxiv.org/abs/1306.3543)...SSDBM, 2013 --- ## Acknowledgments | Role | Person | | :--- | :--- | | Data | Bock, Lee, Reid, Kasthuri, Lichtman, Collman, Weiler, Micheva, Smith, Crow, Deisseroth, Bloss, Spruston, Hildebrand, Engert, Harris, Zlatic, Cardona, Wanner | | Theory | Priebe, Lyzinski, Sussman, Tang, Ketcha, Wang, Fishkind, Durante, Dunson, Chen, Shen, Lindquist, Caffo | | Computer Vision | Kazhdan, Dyer, Gray Roncal, Kutten, Miller, Hager, Chevillet, Simhal, Patsolic, Saalfeld, Sapiro | | Computer Science | Zheng, Lillaney, Baden, Manavalan, Mhembere, Perlman, Koutra, Faloutsos, Szalay, Burns | | Analysis | Kiar, Kleissas, Litt, Wandell, Poldrack, Wiener, Vogelstein |
--- class: middle, center # Questions? ## Hiring Postdocs & Software Engineers Now! e: [jovo@jhu.edu](mailto:jovo@jhu.edu) w: [neurodata.io](http://neurodata.io), [gigantum.io](http://gigantum.io)