blt/bytelatent
Srinivasan Iyer 48e4ad0bd2
make sure max_encoder_seq_length matches ()
* make sure max_encoder_seq_length matches

* black and assert comment

---------

Co-authored-by: Srini Iyer <sviyer@meta.com>
2025-02-12 18:27:22 -08:00
..
configs This includes fixes that make checkpointing and reloading work correctly. () 2025-01-27 16:56:42 -08:00
data Allow ArrowIterator to read from json () 2025-02-06 09:57:22 -08:00
model fix save and reload model state () 2025-02-07 14:27:47 -08:00
plotting Add plotting code from paper () 2025-01-09 12:11:50 -08:00
preprocess Allow ArrowIterator to read from json () 2025-02-06 09:57:22 -08:00
tokenizers Initial commit 2024-12-12 15:32:30 -08:00
.DS_Store Initial commit 2024-12-12 15:32:30 -08:00
__init__.py Initial commit 2024-12-12 15:32:30 -08:00
args.py Allow ArrowIterator to read from json () 2025-02-06 09:57:22 -08:00
base_transformer.py Fix init and repro () 2025-02-06 14:18:02 -08:00
checkpoint.py Update checkpointing to use fsspec () 2025-02-06 09:41:58 -08:00
constants.py Initial commit 2024-12-12 15:32:30 -08:00
distributed.py Add bpb and n_bytes to metric logging () 2025-02-07 13:14:30 -08:00
entropy_model.py Changes for training entropy model and correcting attention in local models () 2025-01-17 14:23:01 -08:00
eval.py This includes fixes that make checkpointing and reloading work correctly. () 2025-01-27 16:56:42 -08:00
float8.py Initial commit 2024-12-12 15:32:30 -08:00
generate.py This includes fixes that make checkpointing and reloading work correctly. () 2025-01-27 16:56:42 -08:00
logger.py Update checkpointing to use fsspec () 2025-02-06 09:41:58 -08:00
metrics.py Add bpb and n_bytes to metric logging () 2025-02-07 13:14:30 -08:00
norms.py Fix distributed all reduce grad norm () 2025-02-04 16:53:50 -08:00
optim.py Initial commit 2024-12-12 15:32:30 -08:00
probe.py Initial commit 2024-12-12 15:32:30 -08:00
profiling.py Initial commit 2024-12-12 15:32:30 -08:00
stool.py Allow ArrowIterator to read from json () 2025-02-06 09:57:22 -08:00
test_blt.py Initial codes and scripts for training entropy model () 2025-01-27 09:46:44 -08:00
test_entropy_model.py Changes for training entropy model and correcting attention in local models () 2025-01-17 14:23:01 -08:00
train.py make sure max_encoder_seq_length matches () 2025-02-12 18:27:22 -08:00
transformer.py Fix init and repro () 2025-02-06 14:18:02 -08:00