..
configs
[WIP] Changes for training entropy model and correcting attention in local models
2025-01-16 21:51:05 +00:00
data
[WIP] Changes for training entropy model and correcting attention in local models
2025-01-16 21:51:05 +00:00
model
[WIP] Changes for training entropy model and correcting attention in local models
2025-01-16 21:51:05 +00:00
plotting
Add plotting code from paper ( #17 )
2025-01-09 12:11:50 -08:00
preprocess
[WIP] Changes for training entropy model and correcting attention in local models
2025-01-16 21:51:05 +00:00
tokenizers
Initial commit
2024-12-12 15:32:30 -08:00
.DS_Store
Initial commit
2024-12-12 15:32:30 -08:00
__init__.py
Initial commit
2024-12-12 15:32:30 -08:00
args.py
[WIP] Changes for training entropy model and correcting attention in local models
2025-01-16 21:51:05 +00:00
base_transformer.py
[WIP] Changes for training entropy model and correcting attention in local models
2025-01-16 21:51:05 +00:00
checkpoint.py
Initial commit
2024-12-12 15:32:30 -08:00
constants.py
Initial commit
2024-12-12 15:32:30 -08:00
distributed.py
allow flex-attention to be disabled ( #19 )
2025-01-14 09:32:07 -08:00
entropy_model.py
Initial commit
2024-12-12 15:32:30 -08:00
float8.py
Initial commit
2024-12-12 15:32:30 -08:00
logger.py
Replace regular filesystem calls with fsspec + add s3 support ( #18 )
2025-01-10 11:04:41 -08:00
metrics.py
Initial commit
2024-12-12 15:32:30 -08:00
optim.py
Initial commit
2024-12-12 15:32:30 -08:00
probe.py
Initial commit
2024-12-12 15:32:30 -08:00
profiling.py
Initial commit
2024-12-12 15:32:30 -08:00
stool.py
Initial commit
2024-12-12 15:32:30 -08:00
test_blt.py
[WIP] Changes for training entropy model and correcting attention in local models
2025-01-16 21:51:05 +00:00
test_entropy_model.py
[WIP] Changes for training entropy model and correcting attention in local models
2025-01-16 21:51:05 +00:00
train.py
Initial commit
2024-12-12 15:32:30 -08:00
transformer.py
[WIP] Changes for training entropy model and correcting attention in local models
2025-01-16 21:51:05 +00:00