Back To Schedule
Tuesday, October 20 • 13:00 - 13:30
Revision Control for Structured Data

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.

Revision control for source code - and especially Git - has caused a great leap forward in software development and delivery. A similar revolution has not yet taken place in data. This talk will discuss the various open source databases that are approaching this problem, the underlying architectures and challenges in building both a 'Git for data' and a 'GitHub for data'.

It will posit that to be a true collaboration and distributed system, it must be:

1) decentralized
2) offline-first: work offline and then resync when online again
3) reliable: conflicts are handled properly
4) private: end-to-end-encrypted, if desired
5) efficient: only changes (diffs) to the data set are transmitted between participants
6) collaborative: multiple people can work on the same data set

Many applications choose the SaaS-route with one central database behind a web service and every frontend displays an instantaneous view of some part of the data set. This breaks most requirements.
The database-as-a-service approach with an MVCC database & the flexibility to version schemas is a prerequisite for success. Finally, the talk will look to the future and the dawn of CI/CD for data.

avatar for Dr. Gavin Mendel-Gleason

Dr. Gavin Mendel-Gleason

CTO, TerminusDB
Dr Gavin Mendel-Gleason is CTO of TerminusDB. He is a former research fellow at Trinity College Dublin in the School of Statistics and Computer Science. His research focuses on databases, logic and verification in software engineering. His work includes contributing to the Seshat... Read More →

avatar for Alick Smith

Alick Smith

Manager, Professional Services, Percona

Tuesday October 20, 2020 13:00 - 13:30 EDT
  Other OSDB