.. |
configs
|
Changes for training entropy model and correcting attention in local models (#25)
|
2025-01-17 14:23:01 -08:00 |
data
|
allow grads when calculating entropies
|
2025-01-18 01:42:00 -06:00 |
model
|
remove spammy warning
|
2025-01-17 18:04:37 -06:00 |
plotting
|
Add plotting code from paper (#17)
|
2025-01-09 12:11:50 -08:00 |
preprocess
|
Changes for training entropy model and correcting attention in local models (#25)
|
2025-01-17 14:23:01 -08:00 |
tokenizers
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
.DS_Store
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
__init__.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
args.py
|
Changes for training entropy model and correcting attention in local models (#25)
|
2025-01-17 14:23:01 -08:00 |
base_transformer.py
|
Changes for training entropy model and correcting attention in local models (#25)
|
2025-01-17 14:23:01 -08:00 |
checkpoint.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
constants.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
distributed.py
|
Changes for training entropy model and correcting attention in local models (#25)
|
2025-01-17 14:23:01 -08:00 |
entropy_model.py
|
Changes for training entropy model and correcting attention in local models (#25)
|
2025-01-17 14:23:01 -08:00 |
float8.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
logger.py
|
Replace regular filesystem calls with fsspec + add s3 support (#18)
|
2025-01-10 11:04:41 -08:00 |
metrics.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
optim.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
probe.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
profiling.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
stool.py
|
Initial commit
|
2024-12-12 15:32:30 -08:00 |
test_blt.py
|
Changes for training entropy model and correcting attention in local models (#25)
|
2025-01-17 14:23:01 -08:00 |
test_entropy_model.py
|
Changes for training entropy model and correcting attention in local models (#25)
|
2025-01-17 14:23:01 -08:00 |
train.py
|
Changes for training entropy model and correcting attention in local models (#25)
|
2025-01-17 14:23:01 -08:00 |
transformer.py
|
Changes for training entropy model and correcting attention in local models (#25)
|
2025-01-17 14:23:01 -08:00 |