blt/bytelatent/data/iterators
Pedro Rodriguez 84854423c4 Replace regular filesystem calls with fsspec + add s3 support
Summary:

For compatibility with either local/nfs or S3 datasets, swap to fsspec.

Add a tool to compare local and remote filesystems

Test Plan:

- Ran regular train script
- Ran with config with data in S3
2025-01-10 01:01:57 +00:00
..
__init__.py Initial commit 2024-12-12 15:32:30 -08:00
abstract_iterator.py Initial commit 2024-12-12 15:32:30 -08:00
arrow_iterator.py Replace regular filesystem calls with fsspec + add s3 support 2025-01-10 01:01:57 +00:00
looping_iterator.py Initial commit 2024-12-12 15:32:30 -08:00
multiprocess_iterator.py Initial commit 2024-12-12 15:32:30 -08:00
packing_iterator.py Initial commit 2024-12-12 15:32:30 -08:00
preprocess_iterator.py Initial commit 2024-12-12 15:32:30 -08:00
sampling_iterator.py Initial commit 2024-12-12 15:32:30 -08:00
sequence_iterator.py Initial commit 2024-12-12 15:32:30 -08:00
test_arrow_iterator.py Initial commit 2024-12-12 15:32:30 -08:00
test_iters.py Initial commit 2024-12-12 15:32:30 -08:00