Commit graph

  • 2d38422eb0
    Merge 45456fa6d8 into fc3399ef40 Pedro Rodriguez 2025-02-21 17:28:29 -0800
  • de774bd98b
    Merge 203bff3696 into sapling-pr-archive-EntilZha sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-21 17:27:19 -0800
  • 6df81f25fe
    Merge 203bff3696 into fc3399ef40 Pedro Rodriguez 2025-02-21 17:27:19 -0800
  • 203bff3696 Pass mask in packing_iterator, correctly handle last batch, fix masking pr65 Pedro Rodriguez 2025-02-22 01:23:16 +0000
  • a0fa496aa2 merge commit for archive created by Sapling Pedro Rodriguez 2025-02-22 01:22:31 +0000
  • 1ede87e1ae Pass mask in packing_iterator, correctly handle last batch Pedro Rodriguez 2025-02-22 01:14:31 +0000
  • e8f8a63e60
    Merge 2655e4cf82 into fc3399ef40 Pedro Rodriguez 2025-02-21 17:13:20 -0800
  • c233487b95
    Merge 2655e4cf82 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-21 17:13:18 -0800
  • 2655e4cf82 Remove byte tokenizer and add config args to switch between byte/patch packing pr68 Pedro Rodriguez 2025-02-22 01:13:00 +0000
  • 44b1e5eaa1
    Merge edf86f6689 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-21 17:12:00 -0800
  • edf86f6689 Remove byte tokenizer and add config args to switch between byte/patch packing Pedro Rodriguez 2025-02-22 01:05:58 +0000
  • 62a3ff55bf merge commit for archive created by Sapling Pedro Rodriguez 2025-02-22 00:46:36 +0000
  • eac7a3fdbe Pass mask in packing_iterator, correctly handle last batch Pedro Rodriguez 2025-02-22 00:46:29 +0000
  • fc3399ef40
    Update iterator inheritance, pass file format args, limit iterator (#63) main Pedro Rodriguez 2025-02-21 16:21:07 -0800
  • 92b9a75391 merge commit for archive created by Sapling Pedro Rodriguez 2025-02-21 19:26:36 +0000
  • 3e9de62763 Pass mask in packing_iterator, correctly handle last batch Pedro Rodriguez 2025-02-20 20:15:45 +0000
  • 06a17a0ddc
    Merge 45456fa6d8 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-20 12:16:25 -0800
  • 86abff94d0
    Merge 55ddb0f84b into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-20 12:16:14 -0800
  • 45456fa6d8 Add vocab and seq len abstract fields pr66 Pedro Rodriguez 2025-02-20 20:15:45 +0000
  • 55ddb0f84b Pass mask in packing_iterator, correctly handle last batch Pedro Rodriguez 2025-02-20 20:15:45 +0000
  • 8baeef13a1 merge commit for archive created by Sapling Pedro Rodriguez 2025-02-20 00:57:24 +0000
  • 0ffe2ab685 Update iterator inheritance, pass file format args, limit iterator pr63 Pedro Rodriguez 2025-02-20 00:56:52 +0000
  • 3c1c247809
    Merge 2a717d6b40 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-19 16:38:06 -0800
  • 2a717d6b40 Update iterators Pedro Rodriguez 2025-02-20 00:35:04 +0000
  • b0956bde99
    Make apex logs less noisy (#60) Pedro Rodriguez 2025-02-18 10:45:56 -0800
  • 4b57d05c3b
    Merge 2f247263b9 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-18 10:43:12 -0800
  • 2f247263b9 Make apex logs less noisy pr60 Pedro Rodriguez 2025-02-18 18:43:06 +0000
  • 82ab5930ec
    Make it possible to specify multiple config files (#54) Pedro Rodriguez 2025-02-18 10:42:44 -0800
  • 75fd18716e merge commit for archive created by Sapling Pedro Rodriguez 2025-02-18 18:41:21 +0000
  • 3117ac1f1f Make it possible to specify multiple config files pr54 Pedro Rodriguez 2025-02-18 18:41:02 +0000
  • 157ff04867
    Merge 655eca670d into 9f29e0de18 Pedro Rodriguez 2025-02-18 18:23:49 +0000
  • 9f29e0de18
    fix(README): correct typo in quickstart instructions (#62) CharlesCNorton 2025-02-18 12:47:58 -0500
  • de575d24b9
    fix(README): correct typo in quickstart instructions CharlesCNorton 2025-02-17 11:30:07 -0500
  • 1a14267b30
    Update README.md with arxiv citation artidoro-patch-1 Artidoro Pagnoni 2025-02-15 11:50:41 -0800
  • f912535cb7
    Merge 655eca670d into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-14 15:46:06 -0800
  • 88dedaa2ec
    Merge a3e0647d03 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-14 15:45:43 -0800
  • 655eca670d Minimal working eval pr46 Pedro Rodriguez 2025-02-14 23:44:54 +0000
  • a3e0647d03 Make apex logs less noisy Pedro Rodriguez 2025-02-14 23:45:11 +0000
  • 52590842e0 merge commit for archive created by Sapling Pedro Rodriguez 2025-02-14 22:51:24 +0000
  • f94babc94e Make it possible to specify multiple config files Pedro Rodriguez 2025-02-14 22:50:23 +0000
  • 018bf98798
    Merge aa78c96ea4 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-14 13:06:55 -0800
  • aa78c96ea4 Make it possible to specify multiple config files Pedro Rodriguez 2025-02-14 21:04:16 +0000
  • ed6300375f
    Merge bec0164820 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-14 13:04:04 -0800
  • bec0164820 Make it possible to specify multiple config files Pedro Rodriguez 2025-02-14 21:03:56 +0000
  • 1c7031b4c4
    Merge be3ff12cfe into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-14 13:03:38 -0800
  • be3ff12cfe Make it possible to specify multiple config files Pedro Rodriguez 2025-02-14 21:03:25 +0000
  • f3e8125f74
    using apex rmsnorm (#57) Srinivasan Iyer 2025-02-14 11:22:03 -0800
  • 89deebc8f3 missed a print apex_rmsnorm Srini Iyer 2025-02-14 19:21:28 +0000
  • 8be8c85a63 black Srini Iyer 2025-02-14 19:20:17 +0000
  • c49e25171e
    Update README.md (#58) Srinivasan Iyer 2025-02-14 11:16:49 -0800
  • da2cf02179 added message for missing apex Srini Iyer 2025-02-14 19:15:22 +0000
  • 8c61ab5e67
    Fix multiprocessing dataloader checkpointing and use it in the train script (#50) Pedro Rodriguez 2025-02-13 11:58:23 -0800
  • 67b6c3b3da
    Update README.md sriniiyer-patch-1 Srinivasan Iyer 2025-02-13 11:57:49 -0800
  • d3bf3a1383 using apex rmsnorm Srini Iyer 2025-02-13 19:46:06 +0000
  • 84afa0f121 merge commit for archive created by Sapling Pedro Rodriguez 2025-02-13 19:01:55 +0000
  • 53529dcc78 Fix multiprocessing dataloader checkpointing and use it in the train script pr50 Pedro Rodriguez 2025-02-13 19:01:48 +0000
  • 76e7b001bb
    Merge 0c6cb995a0 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-13 10:39:03 -0800
  • 0c6cb995a0 Fix multiprocessing dataloader checkpointing and use it in the train script Pedro Rodriguez 2025-02-13 18:38:58 +0000
  • 85c2f28f26
    Test first batch matches (#53) Pedro Rodriguez 2025-02-13 10:05:08 -0800
  • 45d52b7ae3
    Merge ab8f8a4412 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-13 10:04:43 -0800
  • ab8f8a4412 Test first batch matches pr53 Pedro Rodriguez 2025-02-13 18:04:30 +0000
  • 9d907fed1c
    disable reshard after forward (#56) Srinivasan Iyer 2025-02-12 18:33:53 -0800
  • 67624845d0 disable reshard after forward disable_raf Srini Iyer 2025-02-13 00:58:55 +0000
  • 48e4ad0bd2
    make sure max_encoder_seq_length matches (#55) Srinivasan Iyer 2025-02-12 18:27:22 -0800
  • 00c7a6f194 black and assert comment assert_seq Srini Iyer 2025-02-13 02:26:46 +0000
  • 0ce2cd45ef make sure max_encoder_seq_length matches Srini Iyer 2025-02-13 00:52:08 +0000
  • 078791996f
    Merge ece82cb960 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-12 11:25:18 -0800
  • ece82cb960 Make it possible to specify multiple config files Pedro Rodriguez 2025-02-12 19:24:49 +0000
  • 15d9c40abe
    Merge 3e3193c1d4 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-12 10:24:54 -0800
  • c0c5bdba91
    Merge c54c9f0517 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-12 10:24:45 -0800
  • 3e3193c1d4 Fix multiprocessing dataloader checkpointing and use it in the train script Pedro Rodriguez 2025-02-12 18:24:40 +0000
  • c54c9f0517 Test first batch matches Pedro Rodriguez 2025-02-12 18:07:21 +0000
  • ec59c13d81
    Merge bd3cf61bb9 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-12 10:11:44 -0800
  • 9613e0ea5f merge commit for archive created by Sapling Pedro Rodriguez 2025-02-12 18:09:31 +0000
  • bd3cf61bb9 Fix multiprocessing dataloader checkpointing and use it in the train script Pedro Rodriguez 2025-02-12 18:09:26 +0000
  • 4cee32ea8c Test first batch matches Pedro Rodriguez 2025-02-12 18:07:21 +0000
  • b61a612bbb
    Merge 92af9b3f56 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-12 10:07:50 -0800
  • 92af9b3f56 Test first batch matches Pedro Rodriguez 2025-02-12 18:07:21 +0000
  • c6cbacc8c1 merge commit for archive created by Sapling Pedro Rodriguez 2025-02-11 22:56:32 +0000
  • 38cc67a953 Fix multiprocessing dataloader checkpointing and use it in the train script Pedro Rodriguez 2025-02-11 22:56:25 +0000
  • c4b7a01b2b
    Merge 5c8fb4f1b3 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-07 15:27:12 -0800
  • 5c8fb4f1b3 Fix multiprocessing dataloader checkpointing and use it in the train script Pedro Rodriguez 2025-02-07 23:26:48 +0000
  • 22c7fe1d1c
    fix save and reload model state (#49) Srinivasan Iyer 2025-02-07 14:27:47 -0800
  • 3075d7bf83 fix save and reload model state fix_reload_state Srini Iyer 2025-02-07 21:46:34 +0000
  • fe45f69fbf
    Add bpb and n_bytes to metric logging (#41) Pedro Rodriguez 2025-02-07 13:14:30 -0800
  • b35206d756 merge commit for archive created by Sapling Pedro Rodriguez 2025-02-07 21:13:42 +0000
  • 8d7338308e Add bpb and n_bytes to metric logging pr41 Pedro Rodriguez 2025-02-07 00:26:00 +0000
  • 7cc123b4ac
    Merge 45bfe94c1e into aebdc481a8 Pedro Rodriguez 2025-02-08 00:48:03 +0800
  • f783846574 merge commit for archive created by Sapling Pedro Rodriguez 2025-02-07 00:26:06 +0000
  • b6396eb0f4 Add bpb and n_bytes to metric logging Pedro Rodriguez 2025-02-07 00:26:00 +0000
  • aebdc481a8
    Fix init and repro (#48) Srinivasan Iyer 2025-02-06 14:18:02 -0800
  • ba922695b3 comment + black fix_repro Srini Iyer 2025-02-06 22:14:20 +0000
  • 30f82211c4 Fix init and repro Srini Iyer 2025-02-06 20:01:32 +0000
  • ab594996a9
    Merge 4e2ed0aa05 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-06 10:24:36 -0800
  • 4e2ed0aa05 Add bpb and n_bytes to metric logging Pedro Rodriguez 2025-02-06 18:08:01 +0000
  • 936d9437be
    Allow ArrowIterator to read from json (#45) Pedro Rodriguez 2025-02-06 09:57:22 -0800
  • 2950b63cf2
    Merge 9c3c997cae into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-06 09:44:41 -0800
  • 9c3c997cae Allow ArrowIterator to read from json pr45 Pedro Rodriguez 2025-02-06 17:43:20 +0000
  • fff80b86b5
    Merge 0e9421af07 into sapling-pr-archive-EntilZha Pedro Rodriguez 2025-02-06 09:43:15 -0800
  • 0e9421af07 Allow ArrowIterator to read from json Pedro Rodriguez 2025-02-06 17:43:10 +0000