blt/bytelatent
2025-04-09 00:21:48 +00:00
..
configs Remove byte tokenizer and add config args to switch between byte/patch packing (#68) 2025-02-25 11:10:59 -08:00
data Cast int sample id to str 2025-04-09 00:21:48 +00:00
model Fix in-place addition of patch_embds (#85) 2025-03-20 16:46:32 -07:00
plotting Add plotting code from paper (#17) 2025-01-09 12:11:50 -08:00
preprocess Allow ArrowIterator to read from json (#45) 2025-02-06 09:57:22 -08:00
tokenizers Remove byte tokenizer and add config args to switch between byte/patch packing (#68) 2025-02-25 11:10:59 -08:00
.DS_Store Initial commit 2024-12-12 15:32:30 -08:00
__init__.py Initial commit 2024-12-12 15:32:30 -08:00
args.py Fix eval mask (#93) 2025-04-08 10:31:40 -07:00
base_transformer.py Initialize rope embeddings properly for the entropy model (#72) 2025-02-25 15:35:25 -08:00
checkpoint.py Add way to call consolidate (#80) 2025-03-11 16:53:33 -07:00
config_parser.py When merging configs, do not merge data sources (#79) 2025-03-11 11:03:24 -07:00
constants.py Initial commit 2024-12-12 15:32:30 -08:00
distributed.py Init distributed when loading model (#94) 2025-04-08 13:57:28 -07:00
entropy_model.py Some fixes for entropy model predictions (#83) 2025-03-13 10:28:42 -07:00
eval.py Fix eval mask (#93) 2025-04-08 10:31:40 -07:00
float8.py Initial commit 2024-12-12 15:32:30 -08:00
generate.py Init distributed when loading model (#94) 2025-04-08 13:57:28 -07:00
generate_blt.py Get generation working for BLT (#86) 2025-04-01 16:07:55 -07:00
iterate_data.py Update iterate_data (#81) 2025-03-13 10:14:41 -07:00
logger.py Update checkpointing to use fsspec (#39) 2025-02-06 09:41:58 -08:00
metrics.py Get evals working again. (#46) 2025-03-11 09:57:19 -07:00
norms.py Fix distributed all reduce grad norm (#40) 2025-02-04 16:53:50 -08:00
optim.py Initial commit 2024-12-12 15:32:30 -08:00
print_config.py Make it possible to specify multiple config files (#54) 2025-02-18 10:42:44 -08:00
probe.py Initial commit 2024-12-12 15:32:30 -08:00
profiling.py Initial commit 2024-12-12 15:32:30 -08:00
stool.py Fix rsync to not preserve original permissions, instead use destination (#76) 2025-03-05 11:49:41 -08:00
test_blt.py Initial codes and scripts for training entropy model (#34) 2025-01-27 09:46:44 -08:00
test_config_parser.py Make it possible to specify multiple config files (#54) 2025-02-18 10:42:44 -08:00
test_entropy_model.py Test first batch matches (#53) 2025-02-13 10:05:08 -08:00
train.py Get evals working again. (#46) 2025-03-11 09:57:19 -07:00
transformer.py Initialize rope embeddings properly for the entropy model (#72) 2025-02-25 15:35:25 -08:00