..
configs
Remove byte tokenizer and add config args to switch between byte/patch packing ( #68 )
2025-02-25 11:10:59 -08:00
data
Get generation working for BLT ( #86 )
2025-04-01 16:07:55 -07:00
model
Fix in-place addition of patch_embds ( #85 )
2025-03-20 16:46:32 -07:00
plotting
Add plotting code from paper ( #17 )
2025-01-09 12:11:50 -08:00
preprocess
Allow ArrowIterator to read from json ( #45 )
2025-02-06 09:57:22 -08:00
tokenizers
Remove byte tokenizer and add config args to switch between byte/patch packing ( #68 )
2025-02-25 11:10:59 -08:00
.DS_Store
Initial commit
2024-12-12 15:32:30 -08:00
__init__.py
Initial commit
2024-12-12 15:32:30 -08:00
args.py
Get generation working for BLT ( #86 )
2025-04-01 16:07:55 -07:00
base_transformer.py
Initialize rope embeddings properly for the entropy model ( #72 )
2025-02-25 15:35:25 -08:00
checkpoint.py
Add way to call consolidate ( #80 )
2025-03-11 16:53:33 -07:00
config_parser.py
When merging configs, do not merge data sources ( #79 )
2025-03-11 11:03:24 -07:00
constants.py
Initial commit
2024-12-12 15:32:30 -08:00
distributed.py
Get generation working for BLT ( #86 )
2025-04-01 16:07:55 -07:00
entropy_model.py
Some fixes for entropy model predictions ( #83 )
2025-03-13 10:28:42 -07:00
eval.py
Get generation working for BLT ( #86 )
2025-04-01 16:07:55 -07:00
float8.py
Initial commit
2024-12-12 15:32:30 -08:00
generate.py
Get generation working for BLT ( #86 )
2025-04-01 16:07:55 -07:00
generate_blt.py
Get generation working for BLT ( #86 )
2025-04-01 16:07:55 -07:00
iterate_data.py
Update iterate_data ( #81 )
2025-03-13 10:14:41 -07:00
logger.py
Update checkpointing to use fsspec ( #39 )
2025-02-06 09:41:58 -08:00
metrics.py
Get evals working again. ( #46 )
2025-03-11 09:57:19 -07:00
norms.py
Fix distributed all reduce grad norm ( #40 )
2025-02-04 16:53:50 -08:00
optim.py
Initial commit
2024-12-12 15:32:30 -08:00
print_config.py
Make it possible to specify multiple config files ( #54 )
2025-02-18 10:42:44 -08:00
probe.py
Initial commit
2024-12-12 15:32:30 -08:00
profiling.py
Initial commit
2024-12-12 15:32:30 -08:00
stool.py
Fix rsync to not preserve original permissions, instead use destination ( #76 )
2025-03-05 11:49:41 -08:00
test_blt.py
Initial codes and scripts for training entropy model ( #34 )
2025-01-27 09:46:44 -08:00
test_config_parser.py
Make it possible to specify multiple config files ( #54 )
2025-02-18 10:42:44 -08:00
test_entropy_model.py
Test first batch matches ( #53 )
2025-02-13 10:05:08 -08:00
train.py
Get evals working again. ( #46 )
2025-03-11 09:57:19 -07:00
transformer.py
Initialize rope embeddings properly for the entropy model ( #72 )
2025-02-25 15:35:25 -08:00