blt/fixtures/test_docs.jsonl at main - vikarti.anatra/blt - VRR Forge

vikarti.anatra/blt

mirror of https://github.com/facebookresearch/blt.git synced 2025-02-22 13:02:14 +00:00

Pedro Rodriguez fc3399ef40

Lint with Black / lint (push) Waiting to run

Details

Lint with isort / lint (push) Waiting to run

Details

Update iterator inheritance, pass file format args, limit iterator (#63 )

- Create a common class to use in all inheritance for states
- Add a limit iterator that we can use in evals
- Modify ArrowFileIterator behavior to not do arrow path inference if file_format='json'
- Make EvalArgs valid
- Move testing iterators to a common directory to allow usage in multiple test files
- Make it so that SequenceIterator can take a None rng_state, to disable all rng ops (for eval mainly)

Test Plan:

- `pytest bytelatent`
- `python -m bytelatent.train config=../internal-blt/configs/entropy_model.yaml logging.wandb=null eval=null`

2025-02-21 16:21:07 -08:00

4 lines

111 B

JSON

Raw Permalink Blame History

	`{"sample_id": "0", "text": "test_0"}`
	`{"sample_id": "1", "text": "test_1"}`
	`{"sample_id": "2", "text": "test_2"}`