blt/bytelatent
Pedro Rodriguez 7f305b3871
Some checks are pending
Lint with Black / lint (push) Waiting to run
Lint with isort / lint (push) Waiting to run
[WIP] Changes for training entropy model and correcting attention in local models
Summary:

- Refactor local model configs to be separate and clearer
- Add attention arguments and correct which attention is used in local models
- Preparation for being able to have an entropy train script
- Fix failing unit tests

Test Plan:
2025-01-17 22:21:51 +00:00
..
configs [WIP] Changes for training entropy model and correcting attention in local models 2025-01-17 22:21:51 +00:00
data [WIP] Changes for training entropy model and correcting attention in local models 2025-01-17 22:21:51 +00:00
model [WIP] Changes for training entropy model and correcting attention in local models 2025-01-17 22:21:51 +00:00
plotting Add plotting code from paper (#17) 2025-01-09 12:11:50 -08:00
preprocess [WIP] Changes for training entropy model and correcting attention in local models 2025-01-17 22:21:51 +00:00
tokenizers Initial commit 2024-12-12 15:32:30 -08:00
.DS_Store Initial commit 2024-12-12 15:32:30 -08:00
__init__.py Initial commit 2024-12-12 15:32:30 -08:00
args.py [WIP] Changes for training entropy model and correcting attention in local models 2025-01-17 22:21:51 +00:00
base_transformer.py [WIP] Changes for training entropy model and correcting attention in local models 2025-01-17 22:21:51 +00:00
checkpoint.py Initial commit 2024-12-12 15:32:30 -08:00
constants.py Initial commit 2024-12-12 15:32:30 -08:00
distributed.py [WIP] Changes for training entropy model and correcting attention in local models 2025-01-17 22:21:51 +00:00
entropy_model.py [WIP] Changes for training entropy model and correcting attention in local models 2025-01-17 22:21:51 +00:00
float8.py Initial commit 2024-12-12 15:32:30 -08:00
logger.py Replace regular filesystem calls with fsspec + add s3 support (#18) 2025-01-10 11:04:41 -08:00
metrics.py Initial commit 2024-12-12 15:32:30 -08:00
optim.py Initial commit 2024-12-12 15:32:30 -08:00
probe.py Initial commit 2024-12-12 15:32:30 -08:00
profiling.py Initial commit 2024-12-12 15:32:30 -08:00
stool.py Initial commit 2024-12-12 15:32:30 -08:00
test_blt.py [WIP] Changes for training entropy model and correcting attention in local models 2025-01-17 22:21:51 +00:00
test_entropy_model.py [WIP] Changes for training entropy model and correcting attention in local models 2025-01-17 22:21:51 +00:00
train.py [WIP] Changes for training entropy model and correcting attention in local models 2025-01-17 22:21:51 +00:00
transformer.py [WIP] Changes for training entropy model and correcting attention in local models 2025-01-17 22:21:51 +00:00