blt/bytelatent at 2cae41fe1f4a1cfa70b396222df52428b1bcc271 - vikarti.anatra/blt

mirror of https://github.com/facebookresearch/blt.git synced 2025-09-01 18:19:06 +00:00

History

Pedro Rodriguez 2cae41fe1f Some checks failed Lint with Black / lint (push) Has been cancelled Details Lint with isort / lint (push) Has been cancelled Details Get evals working again. - PPL/validation: Works now and uses multi-gpu. For some reason 1 GPU differs from multi-GPU, can debug in a followup PR - Generation evals likely work, but are very slow, so disabled for now Test Plan: ``` torchrun --nproc-per-node 8 -m bytelatent.eval config=../internal-blt/configs/eval.yaml ```		2025-02-28 00:41:01 +00:00
..
configs	Remove byte tokenizer and add config args to switch between byte/patch packing (#68 )	2025-02-25 11:10:59 -08:00
data	Pass mask in packing_iterator, correctly handle last batch, fix masking (#65 )	2025-02-27 11:41:47 -08:00
model	Add vocab and seq len abstract fields (#66 )	2025-02-24 14:41:58 -08:00
plotting	Add plotting code from paper (#17 )	2025-01-09 12:11:50 -08:00
preprocess	Allow ArrowIterator to read from json (#45 )	2025-02-06 09:57:22 -08:00
tokenizers	Remove byte tokenizer and add config args to switch between byte/patch packing (#68 )	2025-02-25 11:10:59 -08:00
.DS_Store	Initial commit	2024-12-12 15:32:30 -08:00
__init__.py	Initial commit	2024-12-12 15:32:30 -08:00
args.py	Get evals working again.	2025-02-28 00:41:01 +00:00
base_transformer.py	Initialize rope embeddings properly for the entropy model (#72 )	2025-02-25 15:35:25 -08:00
checkpoint.py	Update checkpointing to use fsspec (#39 )	2025-02-06 09:41:58 -08:00
config_parser.py	Make it possible to specify multiple config files (#54 )	2025-02-18 10:42:44 -08:00
constants.py	Initial commit	2024-12-12 15:32:30 -08:00
distributed.py	Get evals working again.	2025-02-28 00:41:01 +00:00
entropy_model.py	Changes for training entropy model and correcting attention in local models (#25 )	2025-01-17 14:23:01 -08:00
eval.py	Get evals working again.	2025-02-28 00:41:01 +00:00
float8.py	Initial commit	2024-12-12 15:32:30 -08:00
generate.py	Get evals working again.	2025-02-28 00:41:01 +00:00
logger.py	Update checkpointing to use fsspec (#39 )	2025-02-06 09:41:58 -08:00
metrics.py	Get evals working again.	2025-02-28 00:41:01 +00:00
norms.py	Fix distributed all reduce grad norm (#40 )	2025-02-04 16:53:50 -08:00
optim.py	Initial commit	2024-12-12 15:32:30 -08:00
print_config.py	Make it possible to specify multiple config files (#54 )	2025-02-18 10:42:44 -08:00
probe.py	Initial commit	2024-12-12 15:32:30 -08:00
profiling.py	Initial commit	2024-12-12 15:32:30 -08:00
stool.py	Allow ArrowIterator to read from json (#45 )	2025-02-06 09:57:22 -08:00
test_blt.py	Initial codes and scripts for training entropy model (#34 )	2025-01-27 09:46:44 -08:00
test_config_parser.py	Make it possible to specify multiple config files (#54 )	2025-02-18 10:42:44 -08:00
test_entropy_model.py	Test first batch matches (#53 )	2025-02-13 10:05:08 -08:00
train.py	Get evals working again.	2025-02-28 00:41:01 +00:00
transformer.py	Initialize rope embeddings properly for the entropy model (#72 )	2025-02-25 15:35:25 -08:00