Projects
Some software I am involved in:
- akimbo: analysis of nested, non-tabular data in dataframe workflows
- Intake: a broad library for data ingest and cataloguing
- fsspec: unifying python file-system interfaces
- fastparquet : a python library for efficient, fast tabular data storage and retrieval
- dask and distributed : flexible parallel computation engine
- streamz : real-time low-latency event processing logic and integrations
- zarr : simple but effective n-d array storage with custom copressions for
cloud access
(older)
- anaconda navigator : desktop front-end to the anaconda open data science distribution, in pyqt
- anaconda repo and AEN : enterprise-level solutions for library and data distribution and collaboration
- ADaM and its predecessor anaconda-cluster : create a big data cluster and manage tools remotely
Group affiliations
- pangeo
- pangeo-forge
- conda-forge
- software and data carpentry