blt/bytelatent
Pedro Rodriguez a1d05403b4 Replace regular filesystem calls with fsspec + add s3 support
Summary:

For compatibility with either local/nfs or S3 datasets, swap to fsspec.

Add a tool to compare local and remote filesystems

Test Plan:

- Ran regular train script
- Ran with config with data in S3
2025-01-10 01:04:18 +00:00
..
configs Initial commit 2024-12-12 15:32:30 -08:00
data Replace regular filesystem calls with fsspec + add s3 support 2025-01-10 01:04:18 +00:00
model Initial commit 2024-12-12 15:32:30 -08:00
plotting Add plotting code from paper (#17) 2025-01-09 12:11:50 -08:00
preprocess Initial commit 2024-12-12 15:32:30 -08:00
tokenizers Initial commit 2024-12-12 15:32:30 -08:00
.DS_Store Initial commit 2024-12-12 15:32:30 -08:00
__init__.py Initial commit 2024-12-12 15:32:30 -08:00
args.py Replace regular filesystem calls with fsspec + add s3 support 2025-01-10 01:04:18 +00:00
base_transformer.py Initial commit 2024-12-12 15:32:30 -08:00
checkpoint.py Initial commit 2024-12-12 15:32:30 -08:00
constants.py Initial commit 2024-12-12 15:32:30 -08:00
distributed.py Initial commit 2024-12-12 15:32:30 -08:00
entropy_model.py Initial commit 2024-12-12 15:32:30 -08:00
float8.py Initial commit 2024-12-12 15:32:30 -08:00
logger.py Replace regular filesystem calls with fsspec + add s3 support 2025-01-10 01:04:18 +00:00
metrics.py Initial commit 2024-12-12 15:32:30 -08:00
optim.py Initial commit 2024-12-12 15:32:30 -08:00
probe.py Initial commit 2024-12-12 15:32:30 -08:00
profiling.py Initial commit 2024-12-12 15:32:30 -08:00
stool.py Initial commit 2024-12-12 15:32:30 -08:00
test_blt.py Initial commit 2024-12-12 15:32:30 -08:00
test_entropy_model.py Initial commit 2024-12-12 15:32:30 -08:00
train.py Initial commit 2024-12-12 15:32:30 -08:00
transformer.py Initial commit 2024-12-12 15:32:30 -08:00