You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
databooks is a package to ease the collaboration between data scientists using
Jupyter notebooks, by reducing the number of git conflicts between
different notebooks and resolution of git conflicts when encountered.
The key features include:
CLI tool
Clear notebook metadata
Resolve git conflicts
Simple to use
Simple API for using modelling and comparing notebooks using Pydantic
Simply specify the paths for notebook files to remove metadata. By doing so, we can
already avoid many of the conflicts.
$ databooks meta [OPTIONS] PATHS...
Fix git conflicts for notebooks
Specify the paths for notebook files with conflicts to be fixed. Then, databooks finds
the source notebooks that caused the conflicts and compares them (so no JSON manipulation!)
$ databooks fix [OPTIONS] PATHS...
Assert notebook metadata
Specify paths of notebooks to be checked, an expression or recipe of what you'd like to
enforce. databooks will run your checks and raise errors if any notebook does not
comply with the desired metadata values. This advanced feature allows users to enforce
cell tags, sequential cell execution, maximum number of cells, among many other things!
Instead of launching Jupyter and opening the browser to inspect notebooks, have a quick
look at them in the terminal. All you need is to specify the path(s) of the notebook(s).
$ databooks show [OPTIONS] PATHS...
Show rich notebook diffs
Similar to git diff, but for notebooks! Show a rich diff of the notebooks in the
terminal. Works for comparing git index with the current working directory, comparing
branches or blobs.