Datalad

Latest version: v1.1.5

Safety actively analyzes 723177 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 15 of 16

0.5.0

Not secure
This release includes an avalanche of bug fixes, enhancements, and
additions which at large should stay consistent with previous behavior
but provide better functioning. Lots of code was refactored to provide
more consistent code-base, and some API breakage has happened. Further
work is ongoing to standardize output and results reporting
([1350][])

Most notable changes

- requires [git-annex][] >= 6.20161210 (or better even >= 6.20161210 for
improved functionality)
- commands should now operate on paths specified (if any), without
causing side-effects on other dirty/staged files
- [save][]
- `-a` is deprecated in favor of `-u` or `--all-updates`
so only changes known components get saved, and no new files
automagically added
- `-S` does no longer store the originating dataset in its commit
message
- [add][]
- can specify commit/save message with `-m`
- [add-sibling][] and [create-sibling][]
- now take the name of the sibling (remote) as a `-s` (`--name`)
option, not a positional argument
- `--publish-depends` to setup publishing data and code to multiple
repositories (e.g. github + webserve) should now be functional
see [this comment](https://github.com/datalad/datalad/issues/335#issuecomment-277240733)
- got `--publish-by-default` to specify what refs should be published
by default
- got `--annex-wanted`, `--annex-groupwanted` and `--annex-group`
settings which would be used to instruct annex about preferred
content. [publish][] then will publish data using those settings if
`wanted` is set.
- got `--inherit` option to automagically figure out url/wanted and
other git/annex settings for new remote sub-dataset to be constructed
- [publish][]
- got `--skip-failing` refactored into `--missing` option
which could use new feature of [create-sibling][] `--inherit`

Fixes

- More consistent interaction through ssh - all ssh connections go
through [sshrun][] shim for a "single point of authentication", etc.
- More robust [ls][] operation outside of the datasets
- A number of fixes for direct and v6 mode of annex

Enhancements and new features

- New [drop][] and [remove][] commands
- [clean][]
- got `--what` to specify explicitly what cleaning steps to perform
and now could be invoked with `-r`
- `datalad` and `git-annex-remote*` scripts now do not use setuptools
entry points mechanism and rely on simple import to shorten start up time
- [Dataset][] is also now using [Flyweight pattern][], so the same instance is
reused for the same dataset
- progressbars should not add more empty lines

Internal refactoring

- Majority of the commands now go through `_prep` for arguments validation
and pre-processing to avoid recursive invocations

0.4.1

Not secure
Requires now GitPython >= 2.1.0

Fixes

- [save][]
- to not save staged files if explicit paths were provided
- improved (but not yet complete) support for direct mode
- [update][] to not crash if some sub-datasets are not installed
- do not log calls to `git config` to avoid leakage of possibly
sensitive settings to the logs

Enhancements and new features

- New [rfc822-compliant metadata][] format
- [save][]
- -S to save the change also within all super-datasets
- [add][] now has progress-bar reporting
- [create-sibling-github][] to create a :term:`sibling` of a dataset on
github
- [OpenfMRI][] crawler and datasets were enriched with URLs to separate
files where also available from openfmri s3 bucket
(if upgrading your datalad datasets, you might need to run
`git annex enableremote datalad` to make them available)
- various enhancements to log messages
- web interface
- populates "install" box first thus making UX better over slower
connections

0.4

Not secure
Primarily it is a bugfix release but because of significant refactoring
of the [install][] and [get][] implementation, it gets a new minor release.

Fixes

- be able to [get][] or [install][] while providing paths while being
outside of a dataset
- remote annex datasets get properly initialized
- robust detection of outdated [git-annex][]

Enhancements and new features

- interface changes
- [get][] `--recursion-limit=existing` to not recurse into not-installed
subdatasets
- [get][] `-n` to possibly install sub-datasets without getting any data
- [install][] `--jobs|-J` to specify number of parallel jobs for annex
[get][] call could use (ATM would not work when data comes from archives)
- more (unit-)testing
- documentation: see http://docs.datalad.org/en/latest/basics.html
for basic principles and useful shortcuts in referring to datasets
- various webface improvements: breadcrumb paths, instructions how
to install dataset, show version from the tags, etc.

0.3.1

Not secure
Primarily bugfixes but also a number of enhancements and core
refactorings

Fixes

- do not build manpages and examples during installation to avoid
problems with possibly previously outdated dependencies
- [install][] can be called on already installed dataset (with `-r` or
`-g`)

Enhancements and new features

- complete overhaul of datalad configuration settings handling
(see [Configuration documentation][]), so majority of the environment.
Now uses git format and stores persistent configuration settings under
`.datalad/config` and local within `.git/config`
variables we have used were renamed to match configuration names
- [create-sibling][] does not now by default upload web front-end
- [export][] command with a plug-in interface and `tarball` plugin to export
datasets
- in Python, `.api` functions with rendering of results in command line
got a _-suffixed sibling, which would render results as well in Python
as well (e.g., using `search_` instead of `search` would also render
results, not only output them back as Python objects)
- [get][]
- `--jobs` option (passed to `annex get`) for parallel downloads
- total and per-download (with git-annex >= 6.20160923) progress bars
(note that if content is to be obtained from an archive, no progress
will be reported yet)
- [install][] `--reckless` mode option
- [search][]
- highlights locations and fieldmaps for better readability
- supports `-d^` or `-d///` to point to top-most or centrally
installed meta-datasets
- "complete" paths to the datasets are reported now
- `-s` option to specify which fields (only) to search
- various enhancements and small fixes to [meta-data][] handling, [ls][],
custom remotes, code-base formatting, downloaders, etc
- completely switched to `tqdm` library (`progressbar` is no longer
used/supported)

0.3

Not secure
Lots of everything, including but not limited to

- enhanced index viewer, as the one on http://datasets.datalad.org
- initial new data providers support: [Kaggle][], [BALSA][], [NDA][], [NITRC][]
- initial [meta-data support and management][]
- new and/or improved crawler pipelines for [BALSA][], [CRCNS][], [OpenfMRI][]
- refactored [install][] command, now with separate [get][]
- some other commands renaming/refactoring (e.g., [create-sibling][])
- datalad [search][] would give you an option to install datalad's
super-dataset under ~/datalad if ran outside of a dataset

0.2.3

Not secure
New features and bugfix release

- support of /// urls to point to http://datasets.datalad.org
- variety of fixes and enhancements throughout

Page 15 of 16

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.