\ A flurry of Intake drivers! — Martin Durant

Martin Durant

A flurry of Intake drivers!

written by Martin Durant on 2021-02-26

Intake

TL;DR: the Intake plugins directory has expanded recently, and no longer fits on a single page. Someone (read: me) needs to start making categories and a TOC. This bodes well for the continued growth of Intake uptake.

Intake is...

For those that have not come across it before, Intake is a python package for describing, loading and disseminating datasets in catalogues. Newcomers are encouraged to read the main documentation and the quickstart there to understand how Intake can "take the pain out of data access", whether for the data user, curators, developers or IT professionals.

Importantly for this discussion, Intake provides the same API for accessing catalogues and data sources regardless of the backend, and provides metadata in a consistent way. The more types of data we can access and catalogue, the richer the Intake landscape!

Although a number of "drivers" - classes specialising in loading a particular type of data - are included in the main intake package, many more are provided in optional extra packages. Some of these are owned by the Intake organisation, but many are contributed by the community. A brief glimpse at https://intake.readthedocs.io/en/latest/plugin-directory.html show the currently known list, for those drivers that have specifically asked to be included in the list. Some of these packages contain more than one driver. This list has dynamically grown over time, and with the addition of a few more over the past months, the list has become too long to easily parse, and no longer a one-pager. This can only be good news for the Intake ecosystem!

New drivers

Without further ado, here is a list of new drivers that have appeared recently, with a brief description of each. Most of these have made it onto the list below, and some are just too new for someone to have got around to including them! We include direct links here for the ones that are not yet on the official list.

Intake

Summary

Intake is alive and kicking! Writing drivers is easy, and Intake is proving itself as being a great standardisation layer, to expose heterogeneous data collections to data practitioners.

The future

Look out for upcoming articles on intake-streamz and improvements to streamz in general!

Look out for detailed description and example of Intakes new transformation classes!

Our friends at Brookhaven are developing a scalable, performant Intake server to replace the current prototype included in Intake.