.. |
configs
|
Remove byte tokenizer and add config args to switch between byte/patch packing (#68)
|
2025-02-25 11:10:59 -08:00 |
data
|
Reduce per file resources arrow uses (#77)
|
2025-03-05 15:03:42 -08:00 |
model
|
Add vocab and seq len abstract fields (#66)
|
2025-02-24 14:41:58 -08:00 |
plotting
|
Add plotting code from paper (#17)
|
2025-01-09 12:11:50 -08:00 |
preprocess
|
Allow ArrowIterator to read from json (#45)
|
2025-02-06 09:57:22 -08:00 |
tokenizers
|
Remove byte tokenizer and add config args to switch between byte/patch packing (#68)
|
2025-02-25 11:10:59 -08:00 |
.DS_Store
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
__init__.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
args.py
|
Get evals working again. (#46)
|
2025-03-11 09:57:19 -07:00 |
base_transformer.py
|
Initialize rope embeddings properly for the entropy model (#72)
|
2025-02-25 15:35:25 -08:00 |
checkpoint.py
|
isort
|
2025-03-11 23:53:01 +00:00 |
config_parser.py
|
When merging configs, do not merge data sources (#79)
|
2025-03-11 11:03:24 -07:00 |
constants.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
distributed.py
|
Get evals working again. (#46)
|
2025-03-11 09:57:19 -07:00 |
entropy_model.py
|
Changes for training entropy model and correcting attention in local models (#25)
|
2025-01-17 14:23:01 -08:00 |
eval.py
|
Get evals working again. (#46)
|
2025-03-11 09:57:19 -07:00 |
float8.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
generate.py
|
Get evals working again. (#46)
|
2025-03-11 09:57:19 -07:00 |
iterate_data.py
|
Reduce per file resources arrow uses (#77)
|
2025-03-05 15:03:42 -08:00 |
logger.py
|
Update checkpointing to use fsspec (#39)
|
2025-02-06 09:41:58 -08:00 |
metrics.py
|
Get evals working again. (#46)
|
2025-03-11 09:57:19 -07:00 |
norms.py
|
Fix distributed all reduce grad norm (#40)
|
2025-02-04 16:53:50 -08:00 |
optim.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
print_config.py
|
Make it possible to specify multiple config files (#54)
|
2025-02-18 10:42:44 -08:00 |
probe.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
profiling.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
stool.py
|
Fix rsync to not preserve original permissions, instead use destination (#76)
|
2025-03-05 11:49:41 -08:00 |
test_blt.py
|
Initial codes and scripts for training entropy model (#34)
|
2025-01-27 09:46:44 -08:00 |
test_config_parser.py
|
Make it possible to specify multiple config files (#54)
|
2025-02-18 10:42:44 -08:00 |
test_entropy_model.py
|
Test first batch matches (#53)
|
2025-02-13 10:05:08 -08:00 |
train.py
|
Get evals working again. (#46)
|
2025-03-11 09:57:19 -07:00 |
transformer.py
|
Initialize rope embeddings properly for the entropy model (#72)
|
2025-02-25 15:35:25 -08:00 |