blt/bytelatent
Ink 392117bff2
Some checks are pending
Lint with Black / lint (push) Waiting to run
Lint with isort / lint (push) Waiting to run
Fix realtime entropy patching (#26)
* allow loading of the entropy model directly

* remove unused argument

* remove spammy warning

* allow patch_batch_size to be adjusted in the forward() method

* revert to original patcher style, fix warning

* allow grads when calculating entropies

* fix grad flow

* return preds from calculate_entropies()

* remove legacy arg

* fix an error with monotonicity and small sequence lengths

* ensure patcher is serializable

* revert patcher to original

* remove unused import
2025-01-21 16:34:23 -08:00
..
configs Changes for training entropy model and correcting attention in local models (#25) 2025-01-17 14:23:01 -08:00
data Fix realtime entropy patching (#26) 2025-01-21 16:34:23 -08:00
model Fix realtime entropy patching (#26) 2025-01-21 16:34:23 -08:00
plotting Add plotting code from paper (#17) 2025-01-09 12:11:50 -08:00
preprocess Fix realtime entropy patching (#26) 2025-01-21 16:34:23 -08:00
tokenizers Initial commit 2024-12-12 15:32:30 -08:00
.DS_Store Initial commit 2024-12-12 15:32:30 -08:00
__init__.py Initial commit 2024-12-12 15:32:30 -08:00
args.py Changes for training entropy model and correcting attention in local models (#25) 2025-01-17 14:23:01 -08:00
base_transformer.py Changes for training entropy model and correcting attention in local models (#25) 2025-01-17 14:23:01 -08:00
checkpoint.py Initial commit 2024-12-12 15:32:30 -08:00
constants.py Initial commit 2024-12-12 15:32:30 -08:00
distributed.py Changes for training entropy model and correcting attention in local models (#25) 2025-01-17 14:23:01 -08:00
entropy_model.py Changes for training entropy model and correcting attention in local models (#25) 2025-01-17 14:23:01 -08:00
float8.py Initial commit 2024-12-12 15:32:30 -08:00
logger.py Replace regular filesystem calls with fsspec + add s3 support (#18) 2025-01-10 11:04:41 -08:00
metrics.py Initial commit 2024-12-12 15:32:30 -08:00
optim.py Initial commit 2024-12-12 15:32:30 -08:00
probe.py Initial commit 2024-12-12 15:32:30 -08:00
profiling.py Initial commit 2024-12-12 15:32:30 -08:00
stool.py Initial commit 2024-12-12 15:32:30 -08:00
test_blt.py Changes for training entropy model and correcting attention in local models (#25) 2025-01-17 14:23:01 -08:00
test_entropy_model.py Changes for training entropy model and correcting attention in local models (#25) 2025-01-17 14:23:01 -08:00
train.py Changes for training entropy model and correcting attention in local models (#25) 2025-01-17 14:23:01 -08:00
transformer.py Changes for training entropy model and correcting attention in local models (#25) 2025-01-17 14:23:01 -08:00