Releases: tensorflow/datasets
Releases · tensorflow/datasets
v4.9.7.dev20241125160643
Added
- New datasets.
Changed
CroissantBuilder
's API to generate TFDS datasets from Croissant files.
Deprecated
Removed
Fixed
- Versions for existing datasets.
Security
v4.9.7
Added
- New datasets.
Changed
CroissantBuilder
's API to generate TFDS datasets from Croissant files.
Deprecated
Removed
Fixed
- Versions for existing datasets.
Security
v4.9.6
Added
- Full support for Python 3.12.
v4.9.5
Added
-
Support to download and prepare datasets using the
Parquet data format.builder = tfds.builder('fashion_mnist', file_format='parquet') builder.download_and_prepare() ds = builder.as_dataset(split='train') print(next(iter(ds)))
-
tfds.data_source
is pickable, thus working smoothly with
PyGrain. Learn more by following the
tutorial. -
TFDS plays nicely with
Croissant. Learn more by
following the
recipe.
Changed
Deprecated
Removed
Fixed
Security
v4.9.4
Added
- A new CroissantBuilder
which initializes a DatasetBuilder based on a Croissant
metadata file. - New conversion options between different bounding boxes formats.
- Better support for
HuggingfaceDatasetBuilder
. - A script
to convert a dataset from one format to another.
Changed
Deprecated
- Python 3.9 support. TFDS now uses Python 3.10
Removed
Fixed
Security
v4.9.3
Added
- Segment Anything
(SA-1B) dataset.
Changed
- Hugging Face datasets accept
None
values for any features. TFDS has no
tfds.features.Optional
, soNone
values are converted to default values.
Those default values used to be0
and0.0
for int and float. Now, it's
-inf
as defined by NumPy (e.g.,np.iinfo(np.int32).min
or
np.finfo(np.float32).min
). This avoids ambiguous values when0
and0.0
exist in the values of the dataset. The roadmap is to implement
tfds.features.Optional
.
Deprecated
- Python 3.8 support. As per
NEP 29, TFDS now
uses Python>=3.9.
Removed
Fixed
Security
v4.9.2
Added
- [Experimental] A list of freeform text tags can now be attached to a
BuilderConfig
. For example:The tags are recorded with the dataset metadata and can later be retrievedBUILDER_CONFIGS = [ tfds.core.BuilderConfig(name="foo", tags=["foo", "live"]), tfds.core.BuilderConfig(name="bar", tags=["bar", "old"]), ]
using the info object:This feature is experimental and there are no guidelines on tags format.builder.info.config_tags # ["foo", "live"]
Changed
Deprecated
Removed
Fixed
- Fixed generated proto files (see issue 4858).
Security
v4.9.1
Added
Changed
Deprecated
Removed
Fixed
- The installation on macOS now works (see issues
4805 and
4852). The ArrayRecord
dependency is lazily loaded, so the
TensorFlow-less path is
not possible at the moment on macOS. A fix for this will follow soon.
Security
v4.9.0
Added
- Native support for JAX and PyTorch. TensorFlow is no longer a dependency for
reading datasets. See the
documentation. - Added minival split to
LVIS dataset. - Mixed-human and
machine-generated
robomimic datasets. - WebVid dataset.
- ImagenetPI dataset.
- Wikipedia for
20230201.
Changed
- Support for
tensorflow=2.12
.
Deprecated
Removed
Fixed
Security
v4.8.3
Added
Changed
Deprecated
- Python 3.7 support: this version and future version use Python 3.8.
Removed
Fixed
- Flag
ignore_verifications
from Hugging Face'sdatasets.load_dataset
is
deprecated, and used to cause errors intfds.load(huggingface:foo)
.