Data Provenance - podcast episode cover

Data Provenance

Sep 04, 201723 min
--:--
--:--
Listen in podcast apps:

Episode description

Software engineers are familiar with the idea of versioning code, so you can go back later and revive a past state of the system.  For data scientists who might want to reconstruct past models, though, it's not just about keeping the modeling code.  It's also about saving a version of the data that made the model.  There are a lot of other benefits to keeping track of datasets, so in this episode we'll talk about data lineage or data provenance.
Data Provenance | Linear Digressions podcast - Listen or read transcript on Metacast