mirror of
https://github.com/facebookresearch/blt.git
synced 2025-02-23 13:32:14 +00:00
Summary: - Make arrow iterator able to read from jsonl files, the entropies are omitted in this case - Make the data/checkpoint code fsspec compatible - Fix issues with all reduce with non-bf16 in dist_sum and norm computation. - Minimal fixes to get eval to run, it is slow currently - Add bpb numbers during training Test Plan: Run ``` torchrun --nproc-per-node 8 -m bytelatent.train config=internal/configs/entropy_model.yaml eval=null max_steps=10100 ``` ``` python -m bytelatent.train config=internal/configs/s3_debug.yaml eval=null ``` ``` torchrun --nproc-per-node 8 -m bytelatent.train config=internal/configs/s3_debug.yaml eval=null ``` |
||
---|---|---|
.. | ||
__init__.py | ||
abstract_iterator.py | ||
arrow_iterator.py | ||
looping_iterator.py | ||
multiprocess_iterator.py | ||
packing_iterator.py | ||
preprocess_iterator.py | ||
sampling_iterator.py | ||
sequence_iterator.py | ||
test_arrow_iterator.py | ||
test_iters.py |