Default branch

e299427ae4 · Cast int sample id to str () · Updated 2025-04-09 00:45:45 +00:00

Branches

8212e9b6f2 · fix stool · Updated 2025-02-06 00:55:25 +00:00

40
1

b28ceb624d · Add flag for rope outer in fp32 · Updated 2025-02-06 00:40:51 +00:00

40
2

a27ab3de8e · Fix wandb logging · Updated 2025-02-06 00:07:59 +00:00

41
1

ac257bac19 · Fix distributed all reduce grad norm · Updated 2025-02-05 00:52:52 +00:00

42
1

11cad6c84d · WIP parallel copy script · Updated 2025-01-28 00:57:06 +00:00

42
1

caf82b924e · This includes fixes that make checkpointing and reloading work correctly. · Updated 2025-01-28 00:54:47 +00:00

43
1

34ca1f7d4b · Initial codes and scripts for training entropy model · Updated 2025-01-24 21:59:42 +00:00

44
1

bd461af91a · Use load_async flag to not start MP iterator · Updated 2025-01-24 18:56:28 +00:00

45
1

8a3084c346 · Update file check script to check sizes · Updated 2025-01-22 19:58:04 +00:00

46
1

7f305b3871 · [WIP] Changes for training entropy model and correcting attention in local models · Updated 2025-01-17 22:21:51 +00:00

48
1

d718cfa9a1 · Update preprocess_entropies script to blt inference + add fsspec support · Updated 2025-01-13 23:26:27 +00:00

50
1

a1d05403b4 · Replace regular filesystem calls with fsspec + add s3 support · Updated 2025-01-10 01:04:18 +00:00

51
1

28016f144d · Add plotting code from paper · Updated 2025-01-09 20:06:26 +00:00

52
1