Oumi

Latest version: v0.1.10

Safety actively analyzes 723650 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 2

0.1.4

What's Changed
* Add memory cleanup calls in e2e integration tests by xrdaukar in https://github.com/oumi-ai/oumi/pull/1277
* Set up versioning for our documentation by taenin in https://github.com/oumi-ai/oumi/pull/1275
* Make `qwen2-VL` evaluation job pass by xrdaukar in https://github.com/oumi-ai/oumi/pull/1278
* Add multi-modal (vlm) notebook with Llama 11B by optas in https://github.com/oumi-ai/oumi/pull/1258
* Documentation: Inference -> List supported models by kaisopos in https://github.com/oumi-ai/oumi/pull/1279
* [tiny] update website link by oelachqar in https://github.com/oumi-ai/oumi/pull/1280
* Update all documentation links to the new doc URL by taenin in https://github.com/oumi-ai/oumi/pull/1281
* Update Oumi - A Tour.ipynb by brragorn in https://github.com/oumi-ai/oumi/pull/1282
* Documentation: Judge (minor edits) by kaisopos in https://github.com/oumi-ai/oumi/pull/1283
* Fix citation by oelachqar in https://github.com/oumi-ai/oumi/pull/1285
* Add Deepseek R1 1.5B/32B configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/1276
* Misc eval configs cleanup by xrdaukar in https://github.com/oumi-ai/oumi/pull/1286
* [docs] Describe parallel evaluation by xrdaukar in https://github.com/oumi-ai/oumi/pull/1284
* Update `microsoft/Phi-3-vision-128k-instruct` training config by xrdaukar in https://github.com/oumi-ai/oumi/pull/1287
* Add Together Deepseek R1 inference config by wizeng23 in https://github.com/oumi-ai/oumi/pull/1289
* [minor] vlm notebook minor updates (doc referencing, freeze visual backbone) by optas in https://github.com/oumi-ai/oumi/pull/1288
* Add missing `-m oumi evaluate` argument in eval config by xrdaukar in https://github.com/oumi-ai/oumi/pull/1291
* [docs] Add more references to VL-SFT and SFT notebooks by xrdaukar in https://github.com/oumi-ai/oumi/pull/1293
* Eval config change for `deepseek-ai/DeepSeek-R1-Distill-Llama-70B` by xrdaukar in https://github.com/oumi-ai/oumi/pull/1292
* [notebooks] Update intro & installation instruction by oelachqar in https://github.com/oumi-ai/oumi/pull/1294
* Update notebook intros by oelachqar in https://github.com/oumi-ai/oumi/pull/1296
* [notebooks] Update installation instructions for colab by oelachqar in https://github.com/oumi-ai/oumi/pull/1297
* Add Apache license header to `src/oumi/**/*.py` by wizeng23 in https://github.com/oumi-ai/oumi/pull/1290
* Minor updates to VLM Multimodal notebook by xrdaukar in https://github.com/oumi-ai/oumi/pull/1299
* [docs] Add latest notebooks and update references by oelachqar in https://github.com/oumi-ai/oumi/pull/1300
* [tiny] Add docs auto-generated `.rst` files to gitignore by wizeng23 in https://github.com/oumi-ai/oumi/pull/1298
* [tiny] use GitHub link for header by oelachqar in https://github.com/oumi-ai/oumi/pull/1301
* [docs][tiny] update inference engines reference by oelachqar in https://github.com/oumi-ai/oumi/pull/1302
* Update README/docs to add new DeepSeek models by wizeng23 in https://github.com/oumi-ai/oumi/pull/1304
* [docs] Use `pip install oumi` over `pip install .` by wizeng23 in https://github.com/oumi-ai/oumi/pull/1305
* Tune VLM SFT configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/1306
* Tune VLM configs for SmolVLM and Qwen2-VL by xrdaukar in https://github.com/oumi-ai/oumi/pull/1307
* Update config/notebook pip installs to use PyPI by wizeng23 in https://github.com/oumi-ai/oumi/pull/1308
* [tiny] upgrade torch version by oelachqar in https://github.com/oumi-ai/oumi/pull/1295
* Update logging and unit tests related to chat templates by xrdaukar in https://github.com/oumi-ai/oumi/pull/1311
* fix(docs): "interested by joining" to "interested in joining" by CharlesCNorton in https://github.com/oumi-ai/oumi/pull/1312
* Add HF_TOKEN instructions to Oumi Multimodal notebook by xrdaukar in https://github.com/oumi-ai/oumi/pull/1313
* Update configuration.md by penfever in https://github.com/oumi-ai/oumi/pull/1314
* remove duplicate keys in config example by lucyknada in https://github.com/oumi-ai/oumi/pull/1315
* [Notebooks] Update VLM notebook by xrdaukar in https://github.com/oumi-ai/oumi/pull/1317
* Update parasail_inference_engine.py by jgreer013 in https://github.com/oumi-ai/oumi/pull/1320
* Fix typo and update warning message for OUMI trainer by xrdaukar in https://github.com/oumi-ai/oumi/pull/1319
* [Notebooks] Add a note that a notebook kernel restart may be needed after `pip install oumi` by xrdaukar in https://github.com/oumi-ai/oumi/pull/1318
* Update Phi3 to support multiple images by xrdaukar in https://github.com/oumi-ai/oumi/pull/1321
* Add more detailed comment headers to YAML configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/1310
* [Notebooks] Add a note to Tour notebook to restart kernel after the first `pip install` by xrdaukar in https://github.com/oumi-ai/oumi/pull/1327
* Tweak `--mem-fraction-static` param in sample SGLang configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/1328
* Disallow using `DatasetParams` field names as keys in `DatasetParams.dataset_kwargs` by xrdaukar in https://github.com/oumi-ai/oumi/pull/1324
* Support `dataset_name_override` dataset_kwarg by xrdaukar in https://github.com/oumi-ai/oumi/pull/1188
* Add an util and a test marker for HF token by xrdaukar in https://github.com/oumi-ai/oumi/pull/1329
* Update `llama3-instruct` chat template to align with the original models template by xrdaukar in https://github.com/oumi-ai/oumi/pull/1326
* chore: update launcher.sh by eltociear in https://github.com/oumi-ai/oumi/pull/1333
* [Notebooks] Minor improvements in VLM and CNN notebooks by xrdaukar in https://github.com/oumi-ai/oumi/pull/1335
* Update VLM cluster names in sample commands by xrdaukar in https://github.com/oumi-ai/oumi/pull/1336
* Update our README and docs with the github trending badge. by taenin in https://github.com/oumi-ai/oumi/pull/1340
* Update README.md - Add DeepSeek to supported models by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1343
* Update index.md - Add DeepSeek to supported models by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1344
* Update "GPU Tests" status badge in README page by xrdaukar in https://github.com/oumi-ai/oumi/pull/1345

New Contributors
* CharlesCNorton made their first contribution in https://github.com/oumi-ai/oumi/pull/1312
* lucyknada made their first contribution in https://github.com/oumi-ai/oumi/pull/1315
* eltociear made their first contribution in https://github.com/oumi-ai/oumi/pull/1333

**Full Changelog**: https://github.com/oumi-ai/oumi/compare/v0.1.3...v0.1.4

0.1.3

What's Changed
* Documentation: Judge | Custom Model page by kaisopos in https://github.com/oumi-ai/oumi/pull/1195
* [WIP] Add a notebook for using CNN with custom dataset by xrdaukar in https://github.com/oumi-ai/oumi/pull/1196
* [Cherrypick for launch] Evaluate: return dict of results by kaisopos in https://github.com/oumi-ai/oumi/pull/1197
* Configs Train/Infer/Eval and Llama 3.3v (70b) by optas in https://github.com/oumi-ai/oumi/pull/1200
* Adding an integration test for evaluation fn's output (see PR-1197) by kaisopos in https://github.com/oumi-ai/oumi/pull/1199
* [docs] Add more details and cross-references related to customization by xrdaukar in https://github.com/oumi-ai/oumi/pull/1198
* Define `single_gpu` test marker by xrdaukar in https://github.com/oumi-ai/oumi/pull/1201
* Native inference: Don't set `min_p`, `temperature` in `GenerationConfig` if sampling is disabled by xrdaukar in https://github.com/oumi-ai/oumi/pull/1202
* Update tests to make them runnable on GCP by xrdaukar in https://github.com/oumi-ai/oumi/pull/1203
* Add newline before `pformat(train_config)` by xrdaukar in https://github.com/oumi-ai/oumi/pull/1204
* GCP tests launcher script changes by xrdaukar in https://github.com/oumi-ai/oumi/pull/1205
* [Evaluation] Bug: serialization by kaisopos in https://github.com/oumi-ai/oumi/pull/1207
* [docs] Add inference snippet for together.ai and DeepSeek APIs by oelachqar in https://github.com/oumi-ai/oumi/pull/1208
* Exclude `multi_gpu` tests from GitHub GPU tests by xrdaukar in https://github.com/oumi-ai/oumi/pull/1210
* Update e2e tests to support multi-GPU machines by xrdaukar in https://github.com/oumi-ai/oumi/pull/1206
* Add wrappers for remote inference engines by oelachqar in https://github.com/oumi-ai/oumi/pull/1209
* Vision-Lang & Inference (including LoRA) by optas in https://github.com/oumi-ai/oumi/pull/1174
* [BugFix] Throw a runtime error for quantized models & inference=VLLM by kaisopos in https://github.com/oumi-ai/oumi/pull/1212
* Fix most job configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/1213
* e2e tests update by xrdaukar in https://github.com/oumi-ai/oumi/pull/1216
* [Notebook] Evaluation with Oumi by kaisopos in https://github.com/oumi-ai/oumi/pull/1218
* gpt2: move `include_performance_metrics` param from script to yaml by xrdaukar in https://github.com/oumi-ai/oumi/pull/1217
* Simplify inference engine API by oelachqar in https://github.com/oumi-ai/oumi/pull/1214
* Move configs to experimental by wizeng23 in https://github.com/oumi-ai/oumi/pull/1215
* [docs] Update index page by oelachqar in https://github.com/oumi-ai/oumi/pull/1220
* Update ConsoleLogger to write to STDOUT by xrdaukar in https://github.com/oumi-ai/oumi/pull/1221
* Set `use_spot` to False in our JobConfigs by wizeng23 in https://github.com/oumi-ai/oumi/pull/1222
* Delete `oumi[optional]` install target by wizeng23 in https://github.com/oumi-ai/oumi/pull/1224
* Scaffolding and the first testcase for e2e evaluation tests by xrdaukar in https://github.com/oumi-ai/oumi/pull/1225
* [docs] Update inference engines doc page by oelachqar in https://github.com/oumi-ai/oumi/pull/1227
* Clean-up inference engine builder by oelachqar in https://github.com/oumi-ai/oumi/pull/1226
* [VLLM Engine] Enabling BitsAndBytes quantization by kaisopos in https://github.com/oumi-ai/oumi/pull/1223
* Add example distillation notebook by jgreer013 in https://github.com/oumi-ai/oumi/pull/1228
* Add a script to pre-download models for `gpu_tests` by xrdaukar in https://github.com/oumi-ai/oumi/pull/1231
* Fix multi-GPU inference integration test by xrdaukar in https://github.com/oumi-ai/oumi/pull/1229
* [tiny][docs] Update PEFT/LoRA content by optas in https://github.com/oumi-ai/oumi/pull/1233
* [BugFix] GGUF does not work with VLLM by kaisopos in https://github.com/oumi-ai/oumi/pull/1232
* Re-enable parallel evaluation for VLM-s by xrdaukar in https://github.com/oumi-ai/oumi/pull/1235
* Add multimodal exemplar dataset in our provided mini-datasets by optas in https://github.com/oumi-ai/oumi/pull/1234
* [Tiny] renaming a field name (`init_lora_weights`) by optas in https://github.com/oumi-ai/oumi/pull/1236
* Add more e2e evaluation tests by xrdaukar in https://github.com/oumi-ai/oumi/pull/1237
* Fix pyright breakage when vllm and llama_cpp are not installed by taenin in https://github.com/oumi-ai/oumi/pull/1240
* Update our oumi launch documentation. by taenin in https://github.com/oumi-ai/oumi/pull/1239
* Update index.md title for "Join the Community!" by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1242
* Update quickstart.md - nit for Oumi support request by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1241
* [VLLM Engine] Improve support for GGUF models (incl. auto-download) by kaisopos in https://github.com/oumi-ai/oumi/pull/1238
* Update README.md title to "Join the Community!" by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1243
* Update quickstart.md by brragorn in https://github.com/oumi-ai/oumi/pull/1251
* Update quickstart.md by brragorn in https://github.com/oumi-ai/oumi/pull/1253
* Update quickstart.md by brragorn in https://github.com/oumi-ai/oumi/pull/1252
* Update quickstart.md by brragorn in https://github.com/oumi-ai/oumi/pull/1250
* [Minor refactor] Moving model caching to `oumi.utils` by kaisopos in https://github.com/oumi-ai/oumi/pull/1246
* Add more details to troubleshooting FAQ by wizeng23 in https://github.com/oumi-ai/oumi/pull/1249
* Update training_methods.md - Change compute requirement suggestions by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1245
* Update train.md - nit description change by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1244
* [docs] misc docs feedback by oelachqar in https://github.com/oumi-ai/oumi/pull/1248
* [tiny] Qwen2-VL activate experimental datapipes by optas in https://github.com/oumi-ai/oumi/pull/1247
* Update Oumi - A Tour.ipynb by brragorn in https://github.com/oumi-ai/oumi/pull/1254
* [docs] more docs feedback by oelachqar in https://github.com/oumi-ai/oumi/pull/1255
* Update supported_models.md by penfever in https://github.com/oumi-ai/oumi/pull/1256
* Rename `experimental_use_torch_datapipes` data param by xrdaukar in https://github.com/oumi-ai/oumi/pull/1257
* Add pypi release workflow using testpypi by oelachqar in https://github.com/oumi-ai/oumi/pull/1259
* Update workflow names by oelachqar in https://github.com/oumi-ai/oumi/pull/1262
* Update default idle_minutes_to_autostop to 1 hour. by taenin in https://github.com/oumi-ai/oumi/pull/1264
* update pypi release workflow to use trusted env by oelachqar in https://github.com/oumi-ai/oumi/pull/1265
* Add `padding_side` param to internal model config by xrdaukar in https://github.com/oumi-ai/oumi/pull/1260
* Documentation: Updates on Evaluation/Judge (based on Manos' feedback) by kaisopos in https://github.com/oumi-ai/oumi/pull/1261
* [tiny] less strict requirements by oelachqar in https://github.com/oumi-ai/oumi/pull/1266
* Add Deepseek R1 Distill Llama 8B/70B configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/1263
* Update index.md to highlight beta stage by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1268
* Update README.md to highlight beta stage by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1267
* Disable pre-release packages by oelachqar in https://github.com/oumi-ai/oumi/pull/1270
* Update common_workflows.md - Clarify OpenAI is just an example by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1271
* Documentation: Evaluation page (update to highlight multi-modal) by kaisopos in https://github.com/oumi-ai/oumi/pull/1269
* Update launch.md by taenin in https://github.com/oumi-ai/oumi/pull/1272
* Add pypi release workflow by oelachqar in https://github.com/oumi-ai/oumi/pull/1273
* Documentation: Judge | minor edit (bold) by kaisopos in https://github.com/oumi-ai/oumi/pull/1274


**Full Changelog**: https://github.com/oumi-ai/oumi/compare/v0.1.2...v0.1.3

0.1.2.3

What's Changed
* Re-enable parallel evaluation for VLM-s by xrdaukar in https://github.com/oumi-ai/oumi/pull/1235
* Add multimodal exemplar dataset in our provided mini-datasets by optas in https://github.com/oumi-ai/oumi/pull/1234
* [Tiny] renaming a field name (`init_lora_weights`) by optas in https://github.com/oumi-ai/oumi/pull/1236
* Add more e2e evaluation tests by xrdaukar in https://github.com/oumi-ai/oumi/pull/1237
* Fix pyright breakage when vllm and llama_cpp are not installed by taenin in https://github.com/oumi-ai/oumi/pull/1240
* Update our oumi launch documentation. by taenin in https://github.com/oumi-ai/oumi/pull/1239
* Update index.md title for "Join the Community!" by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1242
* Update quickstart.md - nit for Oumi support request by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1241
* [VLLM Engine] Improve support for GGUF models (incl. auto-download) by kaisopos in https://github.com/oumi-ai/oumi/pull/1238
* Update README.md title to "Join the Community!" by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1243
* Update quickstart.md by brragorn in https://github.com/oumi-ai/oumi/pull/1251
* Update quickstart.md by brragorn in https://github.com/oumi-ai/oumi/pull/1253
* Update quickstart.md by brragorn in https://github.com/oumi-ai/oumi/pull/1252
* Update quickstart.md by brragorn in https://github.com/oumi-ai/oumi/pull/1250
* [Minor refactor] Moving model caching to `oumi.utils` by kaisopos in https://github.com/oumi-ai/oumi/pull/1246
* Add more details to troubleshooting FAQ by wizeng23 in https://github.com/oumi-ai/oumi/pull/1249
* Update training_methods.md - Change compute requirement suggestions by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1245
* Update train.md - nit description change by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1244
* [docs] misc docs feedback by oelachqar in https://github.com/oumi-ai/oumi/pull/1248
* [tiny] Qwen2-VL activate experimental datapipes by optas in https://github.com/oumi-ai/oumi/pull/1247
* Update Oumi - A Tour.ipynb by brragorn in https://github.com/oumi-ai/oumi/pull/1254
* [docs] more docs feedback by oelachqar in https://github.com/oumi-ai/oumi/pull/1255
* Update supported_models.md by penfever in https://github.com/oumi-ai/oumi/pull/1256
* Rename `experimental_use_torch_datapipes` data param by xrdaukar in https://github.com/oumi-ai/oumi/pull/1257
* Add pypi release workflow using testpypi by oelachqar in https://github.com/oumi-ai/oumi/pull/1259
* Update workflow names by oelachqar in https://github.com/oumi-ai/oumi/pull/1262
* Update default idle_minutes_to_autostop to 1 hour. by taenin in https://github.com/oumi-ai/oumi/pull/1264
* update pypi release workflow to use trusted env by oelachqar in https://github.com/oumi-ai/oumi/pull/1265


**Full Changelog**: https://github.com/oumi-ai/oumi/compare/v0.1.2.2...v0.1.2.3

0.1.2alpha

What's Changed
* Update README.md - Better highlight features & nits by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/995
* [tiny] update docstring and cleanup by oelachqar in https://github.com/oumi-ai/oumi/pull/1006
* `Qwen2-VL`: minor updates by xrdaukar in https://github.com/oumi-ai/oumi/pull/1000
* Update README.md - Describe Oumi's most common capabilities by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/996
* Fix readme. by taenin in https://github.com/oumi-ai/oumi/pull/1009
* Updated our ascii logo by taenin in https://github.com/oumi-ai/oumi/pull/1008
* [docs] Update readme by oelachqar in https://github.com/oumi-ai/oumi/pull/1010
* Cleanup scripts by oelachqar in https://github.com/oumi-ai/oumi/pull/1011
* Cleanup experimental folder by oelachqar in https://github.com/oumi-ai/oumi/pull/1012
* Update lists of supported VLM-s in README and docs by xrdaukar in https://github.com/oumi-ai/oumi/pull/1014
* Freeze Python package versions by xrdaukar in https://github.com/oumi-ai/oumi/pull/1007
* Update `blip2`'s chat template to use the "default" one by xrdaukar in https://github.com/oumi-ai/oumi/pull/1015
* Add docstrings how to start vLLM and SGLang servers for `Llama-3.2-11B-Vision-Instruct` by xrdaukar in https://github.com/oumi-ai/oumi/pull/1016
* Evaluation: bugfixing, corner case, unit tests by kaisopos in https://github.com/oumi-ai/oumi/pull/1003
* Configure `asyncio_default_fixture_loop_scope` to reduce pytest warnings by xrdaukar in https://github.com/oumi-ai/oumi/pull/1013
* Update the registry to load registered core values upon use. by taenin in https://github.com/oumi-ai/oumi/pull/1017
* Update default installation instructions to pypi by taenin in https://github.com/oumi-ai/oumi/pull/1018
* [tiny] Update debug datasets by oelachqar in https://github.com/oumi-ai/oumi/pull/1020
* [docs] Address misc docs feedback by oelachqar in https://github.com/oumi-ai/oumi/pull/1019
* [tiny] update pre-defined judges and docs by oelachqar in https://github.com/oumi-ai/oumi/pull/1021
* Parameterize e2e training test, and add config for `Qwen2-VL` by xrdaukar in https://github.com/oumi-ai/oumi/pull/1023
* Remove our docs password from the readme. by taenin in https://github.com/oumi-ai/oumi/pull/1024
* VLM docs update by xrdaukar in https://github.com/oumi-ai/oumi/pull/1025
* Fix loading registered pretrain datasets by wizeng23 in https://github.com/oumi-ai/oumi/pull/1005
* Update `requires_gpus` test decorator to optionally specify min GPU memory requirement by xrdaukar in https://github.com/oumi-ai/oumi/pull/1029
* [tiny] Update GitHub workflows by oelachqar in https://github.com/oumi-ai/oumi/pull/1034
* Update `BaseConfig.from_yaml` to also support Path by xrdaukar in https://github.com/oumi-ai/oumi/pull/1026
* [tiny] Cleanup judge engine builder & fix circular dep by oelachqar in https://github.com/oumi-ai/oumi/pull/1035
* Create GPU GitHub Actions workflow by oelachqar in https://github.com/oumi-ai/oumi/pull/1004
* Add structured outputs support to gemini/vertex engines by oelachqar in https://github.com/oumi-ai/oumi/pull/1022
* [docs] Fix feedback on training and inference user guides by oelachqar in https://github.com/oumi-ai/oumi/pull/1037
* [docs][tiny] fix examples in inference guide by oelachqar in https://github.com/oumi-ai/oumi/pull/1038
* Add a sanity test for circular imports. by taenin in https://github.com/oumi-ai/oumi/pull/1030
* Resolve circular dependencies in Oumi by taenin in https://github.com/oumi-ai/oumi/pull/1039
* Move our circular dependency test to e2e to speed up GPU CI tests. by taenin in https://github.com/oumi-ai/oumi/pull/1040
* Add custom inference engine for gemini API by oelachqar in https://github.com/oumi-ai/oumi/pull/1036
* Define CLI in our quickstart. by taenin in https://github.com/oumi-ai/oumi/pull/1042
* Skip running GPU tests on low-risk code paths by oelachqar in https://github.com/oumi-ai/oumi/pull/1043
* Define more terms in our training docs. by taenin in https://github.com/oumi-ai/oumi/pull/1044
* Fix the broken python text snippet on the train page. by taenin in https://github.com/oumi-ai/oumi/pull/1045
* Fix the second python snippet in the train page. by taenin in https://github.com/oumi-ai/oumi/pull/1046
* [docs] Add Gemini to the list of supported inference API-s, and sort them by xrdaukar in https://github.com/oumi-ai/oumi/pull/1048
* Fix issues in most notebooks by wizeng23 in https://github.com/oumi-ai/oumi/pull/1047
* [docs][tiny] remove termynal from sphinx conf by oelachqar in https://github.com/oumi-ai/oumi/pull/1041
* Fix a typo in the VS Code environment page. by taenin in https://github.com/oumi-ai/oumi/pull/1049
* Define WSL in our vscode docs. by taenin in https://github.com/oumi-ai/oumi/pull/1052
* [tiny] disable unit tests on safe paths by oelachqar in https://github.com/oumi-ai/oumi/pull/1051
* [docs] Fix contributing and open issue links by oelachqar in https://github.com/oumi-ai/oumi/pull/1050
* [evaluations/generative_benchmark] Broken link by kaisopos in https://github.com/oumi-ai/oumi/pull/1054
* Remove dangling reference to `jupyter` in Makefile help by xrdaukar in https://github.com/oumi-ai/oumi/pull/1053
* [evaluations/generative_benchmark] Removing notebook link by kaisopos in https://github.com/oumi-ai/oumi/pull/1055
* Support constrained decoding in SGLang inference engine by xrdaukar in https://github.com/oumi-ai/oumi/pull/1032
* [tiny] Update tutorials page by wizeng23 in https://github.com/oumi-ai/oumi/pull/1056
* Minor updates to Launch.md by taenin in https://github.com/oumi-ai/oumi/pull/1059
* [docs] Update docs/user_guides/infer/infer.md by xrdaukar in https://github.com/oumi-ai/oumi/pull/1058
* Nits for common_workflows.md by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1061
* Nit fixes for acknowledgements.md by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1057
* Add sample trouble shooting for remote jobs. by taenin in https://github.com/oumi-ai/oumi/pull/1062
* Add a Github Issues selector for questions and have it redirect to Discord. by taenin in https://github.com/oumi-ai/oumi/pull/1064
* Package checking: Adding functionality for checking package versioning and fast failing by kaisopos in https://github.com/oumi-ai/oumi/pull/1031
* Fix various typos in contributing.md by taenin in https://github.com/oumi-ai/oumi/pull/1066
* SGLang inference documentation by xrdaukar in https://github.com/oumi-ai/oumi/pull/1065
* Replace assert in `NativeInferenceEngine` with `RuntimeError` by xrdaukar in https://github.com/oumi-ai/oumi/pull/1068
* Update dev set up instructions to use a Fork. by taenin in https://github.com/oumi-ai/oumi/pull/1067
* Define inference configs for more models by xrdaukar in https://github.com/oumi-ai/oumi/pull/1069
* [Evaluation] HF Leaderboards yaml files by kaisopos in https://github.com/oumi-ai/oumi/pull/1071
* Specify `engine: NATIVE` is inference configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/1075
* Improve handling of image path and URLs by xrdaukar in https://github.com/oumi-ai/oumi/pull/1074
* [Doc > Quickstart] Should we add links to guides for better discoverability? by kaisopos in https://github.com/oumi-ai/oumi/pull/1076
* Add e2e tests for running tutorial notebooks by oelachqar in https://github.com/oumi-ai/oumi/pull/1079
* Ignore all experimental files when running our circular dependency test. by taenin in https://github.com/oumi-ai/oumi/pull/1081
* [Super Nit Doc Update] environments.md by kaisopos in https://github.com/oumi-ai/oumi/pull/1082
* Add an env var for loading user registered values (dataset, models, clouds) when initializing the Oumi Registry by taenin in https://github.com/oumi-ai/oumi/pull/1077
* Update internal model configs to support default `tokenizer_pad_token` and `chat_template` by model type by xrdaukar in https://github.com/oumi-ai/oumi/pull/1078
* [Minor] Notebook typo by kaisopos in https://github.com/oumi-ai/oumi/pull/1085
* Upgrade transformers to 4.47 by wizeng23 in https://github.com/oumi-ai/oumi/pull/1033
* [tiny][docs] Update recipes page by wizeng23 in https://github.com/oumi-ai/oumi/pull/1072
* Configure e2e integration test for Llama 3.2 Vision 11B by xrdaukar in https://github.com/oumi-ai/oumi/pull/1086
* Nits for cli_reference.md by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1063
* [Documentation] Evaluate | Leaderboards Page by kaisopos in https://github.com/oumi-ai/oumi/pull/1084
* [Documentation] Evaluate | Main Page (revision) by kaisopos in https://github.com/oumi-ai/oumi/pull/1089
* [tiny] Fix precommit by oelachqar in https://github.com/oumi-ai/oumi/pull/1092
* Add timeout for unit & integration tests by oelachqar in https://github.com/oumi-ai/oumi/pull/1091
* Add GitHub Actions workflow for doctests by oelachqar in https://github.com/oumi-ai/oumi/pull/1093
* [docs] remove unused page, fix links by oelachqar in https://github.com/oumi-ai/oumi/pull/1094
* [Documentation] Evaluate | Main Page (small refactor) by kaisopos in https://github.com/oumi-ai/oumi/pull/1095
* Rewrite of the main Oumi Launch page. by taenin in https://github.com/oumi-ai/oumi/pull/1087
* Remove `pytest.mark.skip()` for basic e2e tests by xrdaukar in https://github.com/oumi-ai/oumi/pull/1088
* [tiny] Upgrade minimum numpy version to unblock python3.12 installation by oelachqar in https://github.com/oumi-ai/oumi/pull/1099
* Update our Readme with a new header image. by taenin in https://github.com/oumi-ai/oumi/pull/1098
* [docs] Minor refresh to dataset resource pages by oelachqar in https://github.com/oumi-ai/oumi/pull/1097
* [docs] Add docs guide page by oelachqar in https://github.com/oumi-ai/oumi/pull/1096
* Add a quick unit test to ensure new dependencies are not added to the top-level CLI by taenin in https://github.com/oumi-ai/oumi/pull/1100
* Updates distributed run to handle local machine scenarios by jgreer013 in https://github.com/oumi-ai/oumi/pull/1028
* [docs] Refresh datasets resource page by oelachqar in https://github.com/oumi-ai/oumi/pull/1101
* Update e2e integration tests to allow `trainer_type` overrides by xrdaukar in https://github.com/oumi-ai/oumi/pull/1090
* Temporary fix for "processing_class" error for OUMI trainer by xrdaukar in https://github.com/oumi-ai/oumi/pull/1102
* Update training logging by xrdaukar in https://github.com/oumi-ai/oumi/pull/1104
* Vision-Language LoRA training by optas in https://github.com/oumi-ai/oumi/pull/1027
* Update the Deploy page to use CLI and python instructions. by taenin in https://github.com/oumi-ai/oumi/pull/1106
* Set `rank/world_size/device_id=` params in `init_process_group()` to resolve warnings by xrdaukar in https://github.com/oumi-ai/oumi/pull/1109
* [Revert] Undo package upgrades including transformers 4.47 by wizeng23 in https://github.com/oumi-ai/oumi/pull/1111
* Move Llama 3.2 Vision 11b SFT FFT configs into subdir by xrdaukar in https://github.com/oumi-ai/oumi/pull/1110
* [docs] Update dataset formats page by oelachqar in https://github.com/oumi-ai/oumi/pull/1112
* [docs] Remove placeholder pages by oelachqar in https://github.com/oumi-ai/oumi/pull/1113
* [docs] address mic feedback from the team by oelachqar in https://github.com/oumi-ai/oumi/pull/1114
* Update quickstart.md by penfever in https://github.com/oumi-ai/oumi/pull/1115
* Minor updates to `oumi env` by xrdaukar in https://github.com/oumi-ai/oumi/pull/1116
* Update `oumi train` path to set `TOKENIZERS_PARALLELISM` variable by xrdaukar in https://github.com/oumi-ai/oumi/pull/1117
* Reorganize oumi distributed run logic by oelachqar in https://github.com/oumi-ai/oumi/pull/1118
* [docs] Update misc Multimodal docs by xrdaukar in https://github.com/oumi-ai/oumi/pull/1120
* [docs] Add badges, smaller logos, and nit fix to notebook by oelachqar in https://github.com/oumi-ai/oumi/pull/1121
* Rename LLAMA 8b and 11B Vision inference `REMOTE_VLLM` configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/1122
* Fix a circular dependency when invoking the oumi src directory via Python. by taenin in https://github.com/oumi-ai/oumi/pull/1119
* [docs] Multimodal SFT datasets: minor updates by xrdaukar in https://github.com/oumi-ai/oumi/pull/1123
* Fix all doctests by wizeng23 in https://github.com/oumi-ai/oumi/pull/1107
* Cleanup legacy constraints on datasets by oelachqar in https://github.com/oumi-ai/oumi/pull/1080
* [tiny] skip alpaca_eval test if not installed by oelachqar in https://github.com/oumi-ai/oumi/pull/1125
* Remove `target_col: ` from VLM configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/1126
* Update our documentation for custom clusters by taenin in https://github.com/oumi-ai/oumi/pull/1128
* Mark target_col and use_async_dataset as deprecated by oelachqar in https://github.com/oumi-ai/oumi/pull/1127
* Update sample VLM eval config by xrdaukar in https://github.com/oumi-ai/oumi/pull/1129
* [Documentation] Evaluate | Standardized Benchmarks page (refactor) by kaisopos in https://github.com/oumi-ai/oumi/pull/1131
* [docs] Add doc section for MMMU benchmark by xrdaukar in https://github.com/oumi-ai/oumi/pull/1130
* Evaluation yaml configs: standardize task name by kaisopos in https://github.com/oumi-ai/oumi/pull/1108
* [doc] Oumi Models (part-1) by optas in https://github.com/oumi-ai/oumi/pull/1083
* Enable FSDP for Llama Vision with OUMI trainer by xrdaukar in https://github.com/oumi-ai/oumi/pull/1105
* [docs] Add an entry for "Fine-tuning a Vision-Language Model" by xrdaukar in https://github.com/oumi-ai/oumi/pull/1132
* [tiny] [docs] troubleshooting improvements by optas in https://github.com/oumi-ai/oumi/pull/1070
* [tiny] [docs] Cleanup touches on Quickstart by optas in https://github.com/oumi-ai/oumi/pull/1133
* VLM e2e integration tests update by xrdaukar in https://github.com/oumi-ai/oumi/pull/1138
* Reduce batch size in Qwen2 VL eval config to reduce chances of OOM by xrdaukar in https://github.com/oumi-ai/oumi/pull/1136
* Update readme, first iteration by oelachqar in https://github.com/oumi-ai/oumi/pull/1135
* Remove BLIP2 configs temporarily by xrdaukar in https://github.com/oumi-ai/oumi/pull/1140
* [tiny] Looser pandas version requirements by oelachqar in https://github.com/oumi-ai/oumi/pull/1141
* Add a Customizing Oumi page to our docs by taenin in https://github.com/oumi-ai/oumi/pull/1142
* Upgrade `wandb` version by xrdaukar in https://github.com/oumi-ai/oumi/pull/1143
* Get all notebooks to run by wizeng23 in https://github.com/oumi-ai/oumi/pull/1139
* Add fineweb e2e integration test by xrdaukar in https://github.com/oumi-ai/oumi/pull/1146
* Update quickstart.md with emoji by michaellema in https://github.com/oumi-ai/oumi/pull/1147
* Remove some notebook E2E tests by wizeng23 in https://github.com/oumi-ai/oumi/pull/1145
* Updated notebooks and quickstart documentation by penfever in https://github.com/oumi-ai/oumi/pull/1134
* updated quickstart with clearer instructions about setting up cloud by penfever in https://github.com/oumi-ai/oumi/pull/1151
* add max_memory_per_gpu and device_map params to lm eval harness by penfever in https://github.com/oumi-ai/oumi/pull/1150
* Add generic prompt-response dataset and update finetuning notebook by jgreer013 in https://github.com/oumi-ai/oumi/pull/1144
* Review default values by wizeng23 in https://github.com/oumi-ai/oumi/pull/1124

New Contributors
* penfever made their first contribution in https://github.com/oumi-ai/oumi/pull/1115
* michaellema made their first contribution in https://github.com/oumi-ai/oumi/pull/1147

**Full Changelog**: https://github.com/oumi-ai/oumi/compare/v0.1.1-alpha...v0.1.2-alpha

0.1.1alpha

What's Changed
* Minimal SkyPilot config for `blip2` and `llava` models for GCP with `TRL_SFT` by xrdaukar in https://github.com/oumi-ai/oumi/pull/573
* Inference Engine async writes by taenin in https://github.com/oumi-ai/oumi/pull/574
* Misc cleanups in `JsonlinesDataset` by xrdaukar in https://github.com/oumi-ai/oumi/pull/576
* Split out cloud dependencies by taenin in https://github.com/oumi-ai/oumi/pull/575
* Disable `sdpa` for `blip2` by xrdaukar in https://github.com/oumi-ai/oumi/pull/579
* Set accelerate version to fix FSDP model saving by wizeng23 in https://github.com/oumi-ai/oumi/pull/580
* Remove AWS as a default dep by taenin in https://github.com/oumi-ai/oumi/pull/582
* Update `ProfilerParams` docstrings to follow the new style by xrdaukar in https://github.com/oumi-ai/oumi/pull/583
* Minor updates in `scripts/benchmarks/minimal_multimodal_training.py` by xrdaukar in https://github.com/oumi-ai/oumi/pull/585
* Add `override` annotations to methods of few Dataset subclasses by xrdaukar in https://github.com/oumi-ai/oumi/pull/584
* Add dataset class for dolly dataset by oelachqar in https://github.com/oumi-ai/oumi/pull/586
* Refactor debugging/device utils, and add new GPU stats measurement functions by xrdaukar in https://github.com/oumi-ai/oumi/pull/587
* Add text jsonlines dataset class by oelachqar in https://github.com/oumi-ai/oumi/pull/589
* Define `DataCollationParams` by xrdaukar in https://github.com/oumi-ai/oumi/pull/581
* Misc updates to Polaris launcher scripts by xrdaukar in https://github.com/oumi-ai/oumi/pull/591
* Set up a new version of the Oumi CLI using Typer by taenin in https://github.com/oumi-ai/oumi/pull/588
* Update handling of GPU fan speed info by xrdaukar in https://github.com/oumi-ai/oumi/pull/595
* Add support for magpie dataset variants by oelachqar in https://github.com/oumi-ai/oumi/pull/594
* Rename GenerationConfig to GenerationParams by wizeng23 in https://github.com/oumi-ai/oumi/pull/592
* Fix cli infer test by wizeng23 in https://github.com/oumi-ai/oumi/pull/598
* Judge Notebook 1: default judge by kaisopos in https://github.com/oumi-ai/oumi/pull/593
* [Tiny] update missing dataset import by oelachqar in https://github.com/oumi-ai/oumi/pull/599
* Update training script to support data collators by xrdaukar in https://github.com/oumi-ai/oumi/pull/590
* Update accelerate version to 1.0.0 by wizeng23 in https://github.com/oumi-ai/oumi/pull/601
* Remove deprecated dataset code paths by oelachqar in https://github.com/oumi-ai/oumi/pull/596
* Refactor Aya & Ultrachat to use oumi dataset sft classes by oelachqar in https://github.com/oumi-ai/oumi/pull/597
* Add Llama train/eval/infer E2E integration test by wizeng23 in https://github.com/oumi-ai/oumi/pull/602
* Set docstring for `NVidiaGpuRuntimeInfo` struct by xrdaukar in https://github.com/oumi-ai/oumi/pull/603
* Add generation params to inference engines by oelachqar in https://github.com/oumi-ai/oumi/pull/600
* [bug] Fix issue loading jsonl datasets from file by oelachqar in https://github.com/oumi-ai/oumi/pull/604
* Add Llama 3B configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/605
* Align pyright checks with latest Pylance version by oelachqar in https://github.com/oumi-ai/oumi/pull/611
* Fix `apply_chat_template` issue in `VisionLanguageSftDataset` by xrdaukar in https://github.com/oumi-ai/oumi/pull/609
* More robust make setup by oelachqar in https://github.com/oumi-ai/oumi/pull/610
* Fix a bug where the new CLI was improperly importing functions from top-level modules. by taenin in https://github.com/oumi-ai/oumi/pull/613
* Add support for the Launch command suite in the new CLI by taenin in https://github.com/oumi-ai/oumi/pull/612
* Support `HuggingFaceH4/llava-instruct-mix-vsft` dataset by xrdaukar in https://github.com/oumi-ai/oumi/pull/608
* [tiny] Fix .gitignore by wizeng23 in https://github.com/oumi-ai/oumi/pull/616
* [tiny] add gpt2 chat template, and update tests to use it by oelachqar in https://github.com/oumi-ai/oumi/pull/617
* Turn off pretty-printing exceptions in our CLI by taenin in https://github.com/oumi-ai/oumi/pull/618
* Cleanup dependencies by oelachqar in https://github.com/oumi-ai/oumi/pull/615
* Upgrade oumi dependencies by oelachqar in https://github.com/oumi-ai/oumi/pull/606
* Update makefile to use uv, add Jupyter target by oelachqar in https://github.com/oumi-ai/oumi/pull/614
* Add miniconda installation target, cleanup unused make commands by oelachqar in https://github.com/oumi-ai/oumi/pull/620
* Update several notebooks with the new EvaluationConfig format. by taenin in https://github.com/oumi-ai/oumi/pull/621
* Make sure conda env is registered by oelachqar in https://github.com/oumi-ai/oumi/pull/622
* Add Llama 3b sft/lora/qlora configs for Polaris by wizeng23 in https://github.com/oumi-ai/oumi/pull/626
* Add check if installation is successful by oelachqar in https://github.com/oumi-ai/oumi/pull/625
* Initial Cambrian integration by xrdaukar in https://github.com/oumi-ai/oumi/pull/557
* [tiny] alpaca - minor reproducibility boost by optas in https://github.com/oumi-ai/oumi/pull/619
* explicitly specify the model's dtype in LMH by optas in https://github.com/oumi-ai/oumi/pull/607
* [tiny] Add flops for T4 GPU by wizeng23 in https://github.com/oumi-ai/oumi/pull/628
* Use a timestamp for job directories on Polaris by taenin in https://github.com/oumi-ai/oumi/pull/627
* [tiny] Fix bug with Polaris job num by wizeng23 in https://github.com/oumi-ai/oumi/pull/629
* Update two VLLM configs. by xrdaukar in https://github.com/oumi-ai/oumi/pull/624
* Add `pip install -U uv;` to `make setup` for existing envs by xrdaukar in https://github.com/oumi-ai/oumi/pull/630
* Disable MFU logging for non-packed datasets by wizeng23 in https://github.com/oumi-ai/oumi/pull/632
* Add config example for long context fine-tuning by oelachqar in https://github.com/oumi-ai/oumi/pull/631
* Add distribution mode flag to llama_tune by wizeng23 in https://github.com/oumi-ai/oumi/pull/635
* Judge Notebook 2: Custom Judge by kaisopos in https://github.com/oumi-ai/oumi/pull/623
* Bugfixes for LLAVA by xrdaukar in https://github.com/oumi-ai/oumi/pull/634
* Update sphinx config and docs to fix misc errors and warnings by oelachqar in https://github.com/oumi-ai/oumi/pull/639
* Factor out OUMI_TOTAL_NUM_GPUS env var by wizeng23 in https://github.com/oumi-ai/oumi/pull/636
* Remove bitsandbytes from train dependencies by oelachqar in https://github.com/oumi-ai/oumi/pull/643
* Enable intershinx to allow linking to external documentation pages by oelachqar in https://github.com/oumi-ai/oumi/pull/640
* Tune few training params for LLAVA and blip2 models by xrdaukar in https://github.com/oumi-ai/oumi/pull/642
* Added support for specifying the inference engine via the InferenceConfig by taenin in https://github.com/oumi-ai/oumi/pull/638
* Add popular pre-training dataset classes by oelachqar in https://github.com/oumi-ai/oumi/pull/641
* Remove openai dependency by oelachqar in https://github.com/oumi-ai/oumi/pull/644
* Update our documentation to point to the new CLI. by taenin in https://github.com/oumi-ai/oumi/pull/645
* Enable dataloaders for VLLM-s (llava and blip2) by xrdaukar in https://github.com/oumi-ai/oumi/pull/646
* Allow gradient clipping to be optional by optas in https://github.com/oumi-ai/oumi/pull/649
* Add support for `add_generation_prompt` in LLAVA chat template by xrdaukar in https://github.com/oumi-ai/oumi/pull/648
* Add a description to the Launch CLI by taenin in https://github.com/oumi-ai/oumi/pull/651
* Add all Llama FSDP GCP configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/637
* Coerce model params to correct dtype for QLoRA FSDP by wizeng23 in https://github.com/oumi-ai/oumi/pull/652
* Use uv for `pip install` commands by wizeng23 in https://github.com/oumi-ai/oumi/pull/653
* Update sphinx docs by oelachqar in https://github.com/oumi-ai/oumi/pull/654
* [Docs] Refactor docs pipeline by oelachqar in https://github.com/oumi-ai/oumi/pull/655
* [docs] swap and configure sphinx theme by oelachqar in https://github.com/oumi-ai/oumi/pull/656
* [Docs] Add documentation placeholders by oelachqar in https://github.com/oumi-ai/oumi/pull/658
* [Docs] Add sphinx-bibtex by oelachqar in https://github.com/oumi-ai/oumi/pull/659
* [Docs] fix rendering issues by oelachqar in https://github.com/oumi-ai/oumi/pull/660
* [docs] fix broken links by oelachqar in https://github.com/oumi-ai/oumi/pull/661
* Fix broken link in readme (dev_setup) by kaisopos in https://github.com/oumi-ai/oumi/pull/662
* [docs][tiny] fix minor doc typos by oelachqar in https://github.com/oumi-ai/oumi/pull/666
* [docs] add autodoc2 template by oelachqar in https://github.com/oumi-ai/oumi/pull/665
* [docs] Add content links and references by oelachqar in https://github.com/oumi-ai/oumi/pull/668
* [docs] switch to myst-nb for rendering notebooks by oelachqar in https://github.com/oumi-ai/oumi/pull/669
* [docs] Add script to generate module summaries by oelachqar in https://github.com/oumi-ai/oumi/pull/670
* [docs] Include cli reference by oelachqar in https://github.com/oumi-ai/oumi/pull/671
* Add dataset submodules by oelachqar in https://github.com/oumi-ai/oumi/pull/667
* Update notebooks to include a descriptive title by oelachqar in https://github.com/oumi-ai/oumi/pull/664
* Update tests/utils/test_device_utils.py by xrdaukar in https://github.com/oumi-ai/oumi/pull/672
* [Inference] Bug in generation config stop tokens by kaisopos in https://github.com/oumi-ai/oumi/pull/663
* Support rewriting special label values to -100 (`ignore_index`) to exclude from loss by xrdaukar in https://github.com/oumi-ai/oumi/pull/657
* Rename emails and website url to Oumi by wizeng23 in https://github.com/oumi-ai/oumi/pull/675
* Update scripts/benchmarks/minimal_multimodal_training.py to be able to test 3 other models by xrdaukar in https://github.com/oumi-ai/oumi/pull/677
* Update data collators TextCollatorWithPadding by xrdaukar in https://github.com/oumi-ai/oumi/pull/674
* Add image support to `llama3-instruct.jinja` chat template by xrdaukar in https://github.com/oumi-ai/oumi/pull/676
* Cleanup vscode launch.json by oelachqar in https://github.com/oumi-ai/oumi/pull/679
* Add Llama 3B DDP configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/681
* Add cff file for citations by oelachqar in https://github.com/oumi-ai/oumi/pull/682
* Small edits to Llama configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/683
* Increase `llava.7b.sft` max length from 512 to 1024 by xrdaukar in https://github.com/oumi-ai/oumi/pull/687
* Rename turn.py to conversation.py by oelachqar in https://github.com/oumi-ai/oumi/pull/685
* Pad token fix by kaisopos in https://github.com/oumi-ai/oumi/pull/678
* Add basic validation for data collators by xrdaukar in https://github.com/oumi-ai/oumi/pull/688
* Support `--image` CLI param in `infer_interactive` by xrdaukar in https://github.com/oumi-ai/oumi/pull/686
* Move sft and preference tuning datasets to their own module by oelachqar in https://github.com/oumi-ai/oumi/pull/691
* Set `model.tokenizer_pad_token` for gpt2 in Oumi Inference example in `vsconfig` by xrdaukar in https://github.com/oumi-ai/oumi/pull/692
* Update our dev set up instructions to not conflict with the new CLI by taenin in https://github.com/oumi-ai/oumi/pull/698
* Add back the polling spinner to the Launch CLI by taenin in https://github.com/oumi-ai/oumi/pull/695
* Add a dummy sft dataset for unit tests by oelachqar in https://github.com/oumi-ai/oumi/pull/697
* Include judge commands into main CLI by oelachqar in https://github.com/oumi-ai/oumi/pull/690
* Replace `logger.exception()` in `TextCollatorWithPadding` with `logger.error()` by xrdaukar in https://github.com/oumi-ai/oumi/pull/700
* [Clean-up] Remove dead code (custom eval) by kaisopos in https://github.com/oumi-ai/oumi/pull/696
* Define `oumi.core.constants` file for global constants by xrdaukar in https://github.com/oumi-ai/oumi/pull/694
* Add serialization helper methods for Conversation object by oelachqar in https://github.com/oumi-ai/oumi/pull/701
* Create pull_request_template.md by taenin in https://github.com/oumi-ai/oumi/pull/693
* Use `-100` as `ignore_index` in `VisionLanguageSftDataset` and in `build_data_collator()` by default by xrdaukar in https://github.com/oumi-ai/oumi/pull/702
* Add main entrypoint, update vscode launch.json to use new cli by oelachqar in https://github.com/oumi-ai/oumi/pull/704
* Update experimental async dataset feature by oelachqar in https://github.com/oumi-ai/oumi/pull/705
* Update new image CLI to support input image for VLLMs by xrdaukar in https://github.com/oumi-ai/oumi/pull/699
* Update sft jsonl dataset to support json files and alpaca format by oelachqar in https://github.com/oumi-ai/oumi/pull/703
* Make interactive inference continually loop by wizeng23 in https://github.com/oumi-ai/oumi/pull/689
* Update vscode launch inference example by xrdaukar in https://github.com/oumi-ai/oumi/pull/706
* Training configurations llama3.1 + ChatML-Template by optas in https://github.com/oumi-ai/oumi/pull/684
* Add help strings for `oumi train/evaluate/infer` by wizeng23 in https://github.com/oumi-ai/oumi/pull/708
* Log peak gpu memory at the end of training by oelachqar in https://github.com/oumi-ai/oumi/pull/710
* Log training git commit and tag by oelachqar in https://github.com/oumi-ai/oumi/pull/709
* Add github issue templates. by taenin in https://github.com/oumi-ai/oumi/pull/711
* Enable more ruff rules by oelachqar in https://github.com/oumi-ai/oumi/pull/712
* Add Llama inference configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/713
* Enable pathlib rule by oelachqar in https://github.com/oumi-ai/oumi/pull/714
* Update index.rst - switch expanded name Open Universal Machine Intelligence and replace with "Oumi" by ciaralema in https://github.com/oumi-ai/oumi/pull/715
* Make sure registry is case agnostic by oelachqar in https://github.com/oumi-ai/oumi/pull/717
* Move config files to new directory structure by wizeng23 in https://github.com/oumi-ai/oumi/pull/720
* Replace `np.array(x)` with to `x.numpy()` to resolve test warnings by xrdaukar in https://github.com/oumi-ai/oumi/pull/724
* Update index.rst by ciaralema in https://github.com/oumi-ai/oumi/pull/719
* Add judge cli tests and bugfixes by oelachqar in https://github.com/oumi-ai/oumi/pull/722
* Update `tests/builders/test_lr_schedules.py` by xrdaukar in https://github.com/oumi-ai/oumi/pull/723
* Add dataset_path parameter by oelachqar in https://github.com/oumi-ai/oumi/pull/716
* [tiny] Update judge configs by oelachqar in https://github.com/oumi-ai/oumi/pull/725
* Rename `--generation.(input/output)_filepath` flags to `--(input/output)_path` and rename `detach` flag by wizeng23 in https://github.com/oumi-ai/oumi/pull/707
* Rename LLAVA sft_sky_job.yaml for consistency with blip2 by xrdaukar in https://github.com/oumi-ai/oumi/pull/729
* Remove deprecated preprocessing function by oelachqar in https://github.com/oumi-ai/oumi/pull/726
* Add DPO base class, initial dataset, integration tests by oelachqar in https://github.com/oumi-ai/oumi/pull/728
* [tiny] Add datasets import to enable registry discovery by oelachqar in https://github.com/oumi-ai/oumi/pull/731
* Add a helper function `is_image_text_llm()` to detect supported image+text LLMs by xrdaukar in https://github.com/oumi-ai/oumi/pull/730
* Add dpo debug dataset and integration test by oelachqar in https://github.com/oumi-ai/oumi/pull/732
* Add inference engine for vertex GCP by oelachqar in https://github.com/oumi-ai/oumi/pull/734
* [docs] Cleanup placeholder documentation pages by oelachqar in https://github.com/oumi-ai/oumi/pull/736
* Native inference with images (`model.generate`), and Processor abstraction by xrdaukar in https://github.com/oumi-ai/oumi/pull/721
* [docs] update faq section by oelachqar in https://github.com/oumi-ai/oumi/pull/739
* [docs] update advanced topics section by oelachqar in https://github.com/oumi-ai/oumi/pull/737
* [docs] update datasets section by oelachqar in https://github.com/oumi-ai/oumi/pull/738
* [docs] Update inference section by oelachqar in https://github.com/oumi-ai/oumi/pull/744
* [docs] Add evaluate and judge user guides by oelachqar in https://github.com/oumi-ai/oumi/pull/745
* [docs] Update training section by oelachqar in https://github.com/oumi-ai/oumi/pull/741
* Support `merve/vqav2-small` dataset by xrdaukar in https://github.com/oumi-ai/oumi/pull/742
* Reorder IMAGE and TEXT messages in coco and Flick datasets by xrdaukar in https://github.com/oumi-ai/oumi/pull/743
* Rename base datasets modules by oelachqar in https://github.com/oumi-ai/oumi/pull/735
* [tests] Move unit tests to unit folder by oelachqar in https://github.com/oumi-ai/oumi/pull/753
* Update training params for LLAVA and BLIP2 models by xrdaukar in https://github.com/oumi-ai/oumi/pull/754
* [docs] Add script to summarize configs by oelachqar in https://github.com/oumi-ai/oumi/pull/751
* [docs] Enable termynal cli and add example by oelachqar in https://github.com/oumi-ai/oumi/pull/750
* [docs] Update apidoc rst files with latest modules by oelachqar in https://github.com/oumi-ai/oumi/pull/749
* [docs] Add logo, analytics, and update sphinx config by oelachqar in https://github.com/oumi-ai/oumi/pull/747
* [docs] Update about section by oelachqar in https://github.com/oumi-ai/oumi/pull/748
* [docs] Add page for using skypilot by oelachqar in https://github.com/oumi-ai/oumi/pull/746
* Update base dataset class to use dataset_path by oelachqar in https://github.com/oumi-ai/oumi/pull/757
* Unit tests with sanity checks for `oumi` chat templates by xrdaukar in https://github.com/oumi-ai/oumi/pull/755
* [docs] Misc docs cleanup tasks by oelachqar in https://github.com/oumi-ai/oumi/pull/756
* And WandB integration to eval by wizeng23 in https://github.com/oumi-ai/oumi/pull/727
* Add 3 new PNG images to testdata by xrdaukar in https://github.com/oumi-ai/oumi/pull/758
* Update LLAVA integration tests to use different test images by xrdaukar in https://github.com/oumi-ai/oumi/pull/761
* [tests] Add shared mock tokenizer mixture, requires_gpus marker by oelachqar in https://github.com/oumi-ai/oumi/pull/759
* Update vllm engine to allow for continuous batching by oelachqar in https://github.com/oumi-ai/oumi/pull/762
* Added text completions collator hard-coded for Llama 3.1 support by jgreer013 in https://github.com/oumi-ai/oumi/pull/752
* Update remote inference engines to show progress, reduce native engine verbosity by oelachqar in https://github.com/oumi-ai/oumi/pull/760
* Add sample oumi launcher configs for LLAVA and blip2 by xrdaukar in https://github.com/oumi-ai/oumi/pull/763
* Create 2 other Launcher configs for LLAVA and blip2 using OUMI training loop by xrdaukar in https://github.com/oumi-ai/oumi/pull/765
* Update CI to include all dependencies, fix newly introduced vllm bugs by oelachqar in https://github.com/oumi-ai/oumi/pull/764
* Suggest using the git root directory when launching jobs via a CLI in dev-build. by taenin in https://github.com/oumi-ai/oumi/pull/766
* [bug] enable logger propagation so that pytest can capture them by oelachqar in https://github.com/oumi-ai/oumi/pull/769
* Update liger kernel integration by oelachqar in https://github.com/oumi-ai/oumi/pull/768
* Add BaseInferenceEngine supported_params to keep track of inference engine parameters by oelachqar in https://github.com/oumi-ai/oumi/pull/770
* Add READMEs for new config structure by wizeng23 in https://github.com/oumi-ai/oumi/pull/767
* Fix `*_no_cuda()` tests in `tests/unit/utils/test_device_utils.py` by xrdaukar in https://github.com/oumi-ai/oumi/pull/775
* Remove support for python 3.8, update to 3.9 by oelachqar in https://github.com/oumi-ai/oumi/pull/772
* Multimodal improvements, and basic support for `meta-llama/Llama-3.2-11B-Vision-Instruct` by xrdaukar in https://github.com/oumi-ai/oumi/pull/771
* Set vLLM max LoRA rank based on LoRA adapter by wizeng23 in https://github.com/oumi-ai/oumi/pull/776
* [WIP] Llama 3.2 Vision 11B configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/777
* Tune MLLMs configs for Llama Vision and LLAVA by xrdaukar in https://github.com/oumi-ai/oumi/pull/779
* Ιnference vLLM: towards fitting big models in GPU memory by optas in https://github.com/oumi-ai/oumi/pull/774
* Adds control for an explicit quantization-compute dtype by optas in https://github.com/oumi-ai/oumi/pull/680
* Correct layer name in Llama vision accelerate config by xrdaukar in https://github.com/oumi-ai/oumi/pull/782
* Clean up configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/781
* Save (pre-)processor configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/780
* Tiny: force batch_size of generation parameters >= 1 by optas in https://github.com/oumi-ai/oumi/pull/783
* Switch to the regular `pip` temporarily in the old Polaris launcher by xrdaukar in https://github.com/oumi-ai/oumi/pull/787
* Define PAD token for Llama 2B model (fineweb pretrain) by xrdaukar in https://github.com/oumi-ai/oumi/pull/785
* Rename `oumi launch stop` to `oumi launch cancel` by wizeng23 in https://github.com/oumi-ai/oumi/pull/786
* [tiny] move collator test to unit folder by oelachqar in https://github.com/oumi-ai/oumi/pull/790
* Add `oumi launch stop` subcommand by wizeng23 in https://github.com/oumi-ai/oumi/pull/789
* Add sft packed dataset and tests by oelachqar in https://github.com/oumi-ai/oumi/pull/788
* CLI documentation updates. by taenin in https://github.com/oumi-ai/oumi/pull/791
* Updated default temperature from 1.0 to 0.0 for generation params by jgreer013 in https://github.com/oumi-ai/oumi/pull/792
* Add a prominent link to our docs. by taenin in https://github.com/oumi-ai/oumi/pull/793
* Fixing 2 broken links in README.md by kaisopos in https://github.com/oumi-ai/oumi/pull/794
* Switch from writing individual conversations to all conversations by jgreer013 in https://github.com/oumi-ai/oumi/pull/796
* Update pull_request_template.md by taenin in https://github.com/oumi-ai/oumi/pull/798
* Fix polling jobs and job status in the CLI by taenin in https://github.com/oumi-ai/oumi/pull/795
* Update base SFT dataset to add support for training on assistant turns only by oelachqar in https://github.com/oumi-ai/oumi/pull/797
* Small fix for building docs by taenin in https://github.com/oumi-ai/oumi/pull/799
* Update polaris user regex to handle usernames with dashes by jgreer013 in https://github.com/oumi-ai/oumi/pull/801
* Simplify `TextCollatorWithPadding` by xrdaukar in https://github.com/oumi-ai/oumi/pull/800
* Minor updates to docstrings and error messages related to `text_with_padding` collator by xrdaukar in https://github.com/oumi-ai/oumi/pull/802
* Adding more control/parameters for text generation by optas in https://github.com/oumi-ai/oumi/pull/784
* Pre-training HSDP configs for Llama 8B for A100-40GB by xrdaukar in https://github.com/oumi-ai/oumi/pull/778
* Fix pyproject.toml merge error by oelachqar in https://github.com/oumi-ai/oumi/pull/805
* Update llama_tune script to support 8B pretraining by xrdaukar in https://github.com/oumi-ai/oumi/pull/804
* Add e2e fine-tuning test by oelachqar in https://github.com/oumi-ai/oumi/pull/806
* Create Polaris and GCP launcher configs for Llama 8B pretraining by xrdaukar in https://github.com/oumi-ai/oumi/pull/807
* Enable sphinx-tabs, update OOM faq page to use it by oelachqar in https://github.com/oumi-ai/oumi/pull/808
* [docs] Start adding doctests by oelachqar in https://github.com/oumi-ai/oumi/pull/809
* Fixed broken skyssh and skycode commands by jgreer013 in https://github.com/oumi-ai/oumi/pull/810
* Enable Accelerate FSDP runs without using the Accelerate launcher by wizeng23 in https://github.com/oumi-ai/oumi/pull/803
* Disable uv in polaris launcher by xrdaukar in https://github.com/oumi-ai/oumi/pull/812
* Migrate configs to use Oumi instead of Accelerate configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/811
* Update pretest.yaml by taenin in https://github.com/oumi-ai/oumi/pull/817
* Update installation.md by taenin in https://github.com/oumi-ai/oumi/pull/814
* Add smollm config by oelachqar in https://github.com/oumi-ai/oumi/pull/815
* Add extra logging in Polaris client by xrdaukar in https://github.com/oumi-ai/oumi/pull/813
* Fix a small typo in quickstart by taenin in https://github.com/oumi-ai/oumi/pull/818
* Correct names of few env vars in `llama_tune.sh` by xrdaukar in https://github.com/oumi-ai/oumi/pull/819
* [tiny] move collator to builders by oelachqar in https://github.com/oumi-ai/oumi/pull/821
* Update sample commands to run Llama 8B model on Polaris by xrdaukar in https://github.com/oumi-ai/oumi/pull/823
* [tiny] update device cleanup by oelachqar in https://github.com/oumi-ai/oumi/pull/822
* Update quickstart to use the new CLI arg format. by taenin in https://github.com/oumi-ai/oumi/pull/820
* Mock out slow tests. by taenin in https://github.com/oumi-ai/oumi/pull/825
* More changes for Llama 8B Polaris by xrdaukar in https://github.com/oumi-ai/oumi/pull/826
* Minor updates to Polaris scripts by xrdaukar in https://github.com/oumi-ai/oumi/pull/827
* Misc documentation updates for installation, quickstart, and tutorials. by taenin in https://github.com/oumi-ai/oumi/pull/828
* Add image utils to load image bytes for messages with `IMAGE_URL`, `IMAGE_PATH` types by xrdaukar in https://github.com/oumi-ai/oumi/pull/829
* Update our documentation to include modules for directories that aren't packages. by taenin in https://github.com/oumi-ai/oumi/pull/830
* Add Llama405B Qlora/Lora/FFT configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/833
* Add `SGLangInferenceEngine` by xrdaukar in https://github.com/oumi-ai/oumi/pull/834
* Switch from 8-bit AdamW to AdamW for FSDP Lora/QLora by wizeng23 in https://github.com/oumi-ai/oumi/pull/835
* Add options for specifying log levels in the CLI. by taenin in https://github.com/oumi-ai/oumi/pull/836
* Request user feedback after training. by taenin in https://github.com/oumi-ai/oumi/pull/837
* Documentation fix for training. by taenin in https://github.com/oumi-ai/oumi/pull/840
* [tiny] pass dataset_path to constructor by oelachqar in https://github.com/oumi-ai/oumi/pull/841
* Move chatqa to projects, add details by oelachqar in https://github.com/oumi-ai/oumi/pull/816
* [tiny] fix typo in docs/quickstart by oelachqar in https://github.com/oumi-ai/oumi/pull/843
* Add unit tests for builders/data module by oelachqar in https://github.com/oumi-ai/oumi/pull/842
* Additional improvements to 405B by wizeng23 in https://github.com/oumi-ai/oumi/pull/838
* Add constrained decoding to vllm engine by oelachqar in https://github.com/oumi-ai/oumi/pull/844
* Add support for remote vllm inference engine by oelachqar in https://github.com/oumi-ai/oumi/pull/845
* Add `override` annotations to misc inference engines methods where missing by xrdaukar in https://github.com/oumi-ai/oumi/pull/846
* Add a debug command for dumping environment information. by taenin in https://github.com/oumi-ai/oumi/pull/839
* [tiny] fix openai remote engine auth bug by oelachqar in https://github.com/oumi-ai/oumi/pull/848
* Update RemoteInferenceEngine to support guided decoding by oelachqar in https://github.com/oumi-ai/oumi/pull/849
* Support multimodal inference with remote engines by xrdaukar in https://github.com/oumi-ai/oumi/pull/847
* Fixes for quickstart and launch commands. by taenin in https://github.com/oumi-ai/oumi/pull/851
* Support Qwen2-VL model for training by xrdaukar in https://github.com/oumi-ai/oumi/pull/850
* Fix our config parsing test by taenin in https://github.com/oumi-ai/oumi/pull/855
* Move `remote_params` from `GenerationParams` to `InferenceConfig` by xrdaukar in https://github.com/oumi-ai/oumi/pull/853
* Tune PyTorch CUDA allocation params for Llama 405B qlora by xrdaukar in https://github.com/oumi-ai/oumi/pull/854
* Update Llama 405B qlora config to use `adamw_torch_fused` optimizer by xrdaukar in https://github.com/oumi-ai/oumi/pull/856
* Update vllm config path in Polaris multi-node inference notebook by xrdaukar in https://github.com/oumi-ai/oumi/pull/857
* Change config dir structure for LLAVA and blip2 models by xrdaukar in https://github.com/oumi-ai/oumi/pull/859
* Enable LM-harness eval for multimodal, and add eval config for Llama 3.2 Vision 11B by xrdaukar in https://github.com/oumi-ai/oumi/pull/858
* Update inference engine docs. by taenin in https://github.com/oumi-ai/oumi/pull/862
* Initial port of zigzag attention by xrdaukar in https://github.com/oumi-ai/oumi/pull/861
* Add integration tests for visual SFT datasets (`VisionLanguageSftDataset`) by xrdaukar in https://github.com/oumi-ai/oumi/pull/863
* Update base config to_yaml signature by oelachqar in https://github.com/oumi-ai/oumi/pull/865
* Add support for batch inference by oelachqar in https://github.com/oumi-ai/oumi/pull/864
* Parallelize data preprocessing in `BaseMapDataset` by xrdaukar in https://github.com/oumi-ai/oumi/pull/868
* Tweak writer batch size computation in `BaseMapDataset` by xrdaukar in https://github.com/oumi-ai/oumi/pull/870
* Refresh fine-tuning notebook by oelachqar in https://github.com/oumi-ai/oumi/pull/872
* Support freezing nested model layers by xrdaukar in https://github.com/oumi-ai/oumi/pull/873
* Update dev_setup.md with minor clarification for use of SSH in GitHub by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/866
* Rewrite CONTRIBUTING.md with new content, fixes and nits by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/867
* Phi3 Vision (`microsoft/Phi-3-vision-128k-instruct`): Basic training support by xrdaukar in https://github.com/oumi-ai/oumi/pull/871
* Update main readme (part 1) by oelachqar in https://github.com/oumi-ai/oumi/pull/874
* Update readme sections (part 2) by oelachqar in https://github.com/oumi-ai/oumi/pull/875
* Refresh documentation pages by oelachqar in https://github.com/oumi-ai/oumi/pull/877
* `Qwen2-VL-2B-Instruct`: basic training config by xrdaukar in https://github.com/oumi-ai/oumi/pull/876
* Update recipes list in readme by oelachqar in https://github.com/oumi-ai/oumi/pull/878
* Update the list of supported VLM-s in README by xrdaukar in https://github.com/oumi-ai/oumi/pull/879
* Update documentation landing page by oelachqar in https://github.com/oumi-ai/oumi/pull/881
* Cap TRL library version to <0.12.2 by jgreer013 in https://github.com/oumi-ai/oumi/pull/882
* [tiny] fix remote inference bug for messages with image urls by oelachqar in https://github.com/oumi-ai/oumi/pull/883
* Adding support for Alpaca Eval by kaisopos in https://github.com/oumi-ai/oumi/pull/880
* Improve error messaging for the remote inference engine. by taenin in https://github.com/oumi-ai/oumi/pull/884
* Move multimodal configs from `configs/examples/vision` to `configs/recipes/vision` by xrdaukar in https://github.com/oumi-ai/oumi/pull/886
* Define internal VLM model configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/885
* Alpaca Eval 2.0 notebook by kaisopos in https://github.com/oumi-ai/oumi/pull/887
* Freeze torchdata>=0.8.0,<0.10.0 by xrdaukar in https://github.com/oumi-ai/oumi/pull/890
* Create torchrun, accelerate CLI wrapper by xrdaukar in https://github.com/oumi-ai/oumi/pull/888
* Update VLM configs to use `oumi distributed torchrun` by xrdaukar in https://github.com/oumi-ai/oumi/pull/891
* Allow None batch size in `GenerationParams` by wizeng23 in https://github.com/oumi-ai/oumi/pull/895
* Evaluation: Saving metadata for future repro by kaisopos in https://github.com/oumi-ai/oumi/pull/892
* Fix missing import precommits when you create a fresh install of Oumi by taenin in https://github.com/oumi-ai/oumi/pull/898
* Update inference documentation by wizeng23 in https://github.com/oumi-ai/oumi/pull/893
* Allow simpler method of specifying LoRA Adapter for eval/inference by wizeng23 in https://github.com/oumi-ai/oumi/pull/889
* Introduce complex `Message.content` with sub-items - Phase 1 by xrdaukar in https://github.com/oumi-ai/oumi/pull/897
* Enable `Message.content` with sub-items - phase 2 by xrdaukar in https://github.com/oumi-ai/oumi/pull/901
* Update configs/notebooks to use typer CLI instead of `oumi.train/eval/infer` modules by wizeng23 in https://github.com/oumi-ai/oumi/pull/902
* Evaluation Notebooks: MT Bench by kaisopos in https://github.com/oumi-ai/oumi/pull/899
* Update chat template/conversation format - Phase 3 by xrdaukar in https://github.com/oumi-ai/oumi/pull/903
* Update `make skycode` to use Oumi launcher by wizeng23 in https://github.com/oumi-ai/oumi/pull/894
* Delete `main()` functions in train/evaluate/infer.py by wizeng23 in https://github.com/oumi-ai/oumi/pull/900
* Update Inference links in oumi README by xrdaukar in https://github.com/oumi-ai/oumi/pull/904
* Define `qwen2-vl-instruct` chat template by xrdaukar in https://github.com/oumi-ai/oumi/pull/907
* Set `max_pixels`/`min_pixels` params for Qwen2 VL model by xrdaukar in https://github.com/oumi-ai/oumi/pull/908
* Update chat template/conversation format - Phase 4 by xrdaukar in https://github.com/oumi-ai/oumi/pull/906
* Update configs to use `oumi distributed torchrun` by wizeng23 in https://github.com/oumi-ai/oumi/pull/905
* [tiny] minor docs updates by oelachqar in https://github.com/oumi-ai/oumi/pull/913
* [docs] clean-up docs pages by oelachqar in https://github.com/oumi-ai/oumi/pull/914
* Initial support for `HuggingFaceTB/SmolVLM-Instruct` by xrdaukar in https://github.com/oumi-ai/oumi/pull/910
* Catch exceptions for missing LoRA adapters by taenin in https://github.com/oumi-ai/oumi/pull/918
* Exclude custom models from Vision-Language model checks. by taenin in https://github.com/oumi-ai/oumi/pull/919
* Refactor `is_custom_model()` check into a function + Minor tests updates by xrdaukar in https://github.com/oumi-ai/oumi/pull/921
* Disable FSDP's `use_orig_params` for PEFT training by wizeng23 in https://github.com/oumi-ai/oumi/pull/869
* Mark `Message` as frozen by xrdaukar in https://github.com/oumi-ai/oumi/pull/923
* Rename `MessageContentItem` to `ContentItem` for brevity. by xrdaukar in https://github.com/oumi-ai/oumi/pull/925
* Delete `configs/examples/llama3_1` by wizeng23 in https://github.com/oumi-ai/oumi/pull/922
* Update OOM FAQ with instructions to tune CUDA alloc by xrdaukar in https://github.com/oumi-ai/oumi/pull/926
* Update `oumi launch` documentation by wizeng23 in https://github.com/oumi-ai/oumi/pull/930
* 90% speed up for the Oumi CLI by taenin in https://github.com/oumi-ai/oumi/pull/929
* Update `docs/datasets/vl_sft.md` and `docs/datasets/local_datasets.md` reflecting the latest changes in Conversation format and VL-dataset setup by xrdaukar in https://github.com/oumi-ai/oumi/pull/928
* Remove remaining references to `MessageContentItem` by xrdaukar in https://github.com/oumi-ai/oumi/pull/931
* Extend dev setup guide by wizeng23 in https://github.com/oumi-ai/oumi/pull/932
* Clean up configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/933
* Add ascii art to all oumi commands in the CLI. by taenin in https://github.com/oumi-ai/oumi/pull/934
* [docs] Reorganize doc pages by oelachqar in https://github.com/oumi-ai/oumi/pull/935
* Fix broken pip targets by wizeng23 in https://github.com/oumi-ai/oumi/pull/927
* [docs] Enable doctest directive and add make doctests by oelachqar in https://github.com/oumi-ai/oumi/pull/937
* Evaluation refactoring by kaisopos in https://github.com/oumi-ai/oumi/pull/912
* Migrate Sky configs to Oumi job configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/938
* [docs] fix misc doc build errors / warnings by oelachqar in https://github.com/oumi-ai/oumi/pull/936
* Update fineweb config by wizeng23 in https://github.com/oumi-ai/oumi/pull/940
* Evaluation refactoring (2nd) by kaisopos in https://github.com/oumi-ai/oumi/pull/939
* Update CLI parser to support `=` separated arguments by xrdaukar in https://github.com/oumi-ai/oumi/pull/942
* Integrating with AlpacaEval (PR 1 of 2) by kaisopos in https://github.com/oumi-ai/oumi/pull/941
* [Bug Fix] AlpacaEval by kaisopos in https://github.com/oumi-ai/oumi/pull/943
* Configure logging for `oumi.telemetry` in `train.py` by xrdaukar in https://github.com/oumi-ai/oumi/pull/944
* Only print ASCII log on main process by xrdaukar in https://github.com/oumi-ai/oumi/pull/946
* [bug] Fix issue with registered dataset loading by oelachqar in https://github.com/oumi-ai/oumi/pull/947
* Alpaca Eval: integration test by kaisopos in https://github.com/oumi-ai/oumi/pull/945
* Print telemetry stats in more compact format by xrdaukar in https://github.com/oumi-ai/oumi/pull/950
* Refactor `conversation_utils` by xrdaukar in https://github.com/oumi-ai/oumi/pull/951
* Improve rank detection for logging by xrdaukar in https://github.com/oumi-ai/oumi/pull/953
* [docs] Add core concepts page by oelachqar in https://github.com/oumi-ai/oumi/pull/952
* [docs][tiny] Update acknowledgement page by oelachqar in https://github.com/oumi-ai/oumi/pull/954
* [docs][tiny] remove unused pages by oelachqar in https://github.com/oumi-ai/oumi/pull/955
* [docs][bug] Fix rendering issue in installation page by oelachqar in https://github.com/oumi-ai/oumi/pull/956
* [docs] Refresh evaluate user guide by oelachqar in https://github.com/oumi-ai/oumi/pull/961
* [docs] Refresh datasets resource section by oelachqar in https://github.com/oumi-ai/oumi/pull/960
* [docs] Refresh models resource section by oelachqar in https://github.com/oumi-ai/oumi/pull/959
* [docs] Refresh infer user guide by oelachqar in https://github.com/oumi-ai/oumi/pull/957
* [docs] Refresh training user guide by oelachqar in https://github.com/oumi-ai/oumi/pull/958
* [docs][tiny] Update links by oelachqar in https://github.com/oumi-ai/oumi/pull/962
* [docs] Update doc generation config by oelachqar in https://github.com/oumi-ai/oumi/pull/963
* [docs] Fix misc docstring rendering issues by oelachqar in https://github.com/oumi-ai/oumi/pull/964
* [docs] Refresh judge user guide by oelachqar in https://github.com/oumi-ai/oumi/pull/965
* [docs] Add documentation page for local training by oelachqar in https://github.com/oumi-ai/oumi/pull/966
* [docs] Update documentation index page by oelachqar in https://github.com/oumi-ai/oumi/pull/967
* [Evaluation] Refactor repro info & add repro info in AlpacaEval by kaisopos in https://github.com/oumi-ai/oumi/pull/949
* [docs] Add documentation page for notebook environment by oelachqar in https://github.com/oumi-ai/oumi/pull/969
* Add more options for saving LoRA models by wizeng23 in https://github.com/oumi-ai/oumi/pull/968
* [docs] Add vscode environment doc page by oelachqar in https://github.com/oumi-ai/oumi/pull/970
* [docs] Cleanup evaluation guide page by oelachqar in https://github.com/oumi-ai/oumi/pull/971
* [docs] Update recipes page by oelachqar in https://github.com/oumi-ai/oumi/pull/972
* [docs] Update tutorials page by oelachqar in https://github.com/oumi-ai/oumi/pull/973
* [docs] Add doc page for generative evals by oelachqar in https://github.com/oumi-ai/oumi/pull/974
* [docs] misc clarifications in train, judge, dataset pages by oelachqar in https://github.com/oumi-ai/oumi/pull/976
* [docs][tiny] Remove train/common_workflows page by oelachqar in https://github.com/oumi-ai/oumi/pull/977
* [docs] Add infer common workflows page by oelachqar in https://github.com/oumi-ai/oumi/pull/978
* Update notebooks setup documentation by wizeng23 in https://github.com/oumi-ai/oumi/pull/975
* [docs] Add supported models page by oelachqar in https://github.com/oumi-ai/oumi/pull/979
* [docs] update inference engine user guide by oelachqar in https://github.com/oumi-ai/oumi/pull/980
* Refactor conversation JSON builders by xrdaukar in https://github.com/oumi-ai/oumi/pull/981
* [docs] Update CLI reference page by oelachqar in https://github.com/oumi-ai/oumi/pull/983
* [docs] Minor updates to datasets intro page by oelachqar in https://github.com/oumi-ai/oumi/pull/984
* Remove internal GCS mentions by wizeng23 in https://github.com/oumi-ai/oumi/pull/982
* Update vision/smollm config names by wizeng23 in https://github.com/oumi-ai/oumi/pull/987
* Update `VLLMInferenceEngine` to support multimodal conversations by xrdaukar in https://github.com/oumi-ai/oumi/pull/986
* Image utils refactorings by xrdaukar in https://github.com/oumi-ai/oumi/pull/985
* [docs] Add training monitoring documentation page by oelachqar in https://github.com/oumi-ai/oumi/pull/988
* Add `torchvision` dependency by xrdaukar in https://github.com/oumi-ai/oumi/pull/989
* Define `configs/recipes/vision/phi3/sft/trl_gcp_job.yaml` for Phi3 Vision by xrdaukar in https://github.com/oumi-ai/oumi/pull/990
* `data.train.text_col` cleanup for VLM-s by xrdaukar in https://github.com/oumi-ai/oumi/pull/991
* [Evaluation] Updating Documentation (1st pass) by kaisopos in https://github.com/oumi-ai/oumi/pull/993
* Fix saving merged lora model by wizeng23 in https://github.com/oumi-ai/oumi/pull/992
* Add an option to return custom datasets as `IterableDataset` by xrdaukar in https://github.com/oumi-ai/oumi/pull/994
* Fix regression in CLI speed by taenin in https://github.com/oumi-ai/oumi/pull/997
* Create alias for json/jsonl datasets by oelachqar in https://github.com/oumi-ai/oumi/pull/998
* [Evaluation] Updating Documentation (Generative Benchmarks) by kaisopos in https://github.com/oumi-ai/oumi/pull/999
* [docs][tiny] Fix broken links, update sft dataset examples by oelachqar in https://github.com/oumi-ai/oumi/pull/1002

New Contributors
* ciaralema made their first contribution in https://github.com/oumi-ai/oumi/pull/715

**Full Changelog**: https://github.com/oumi-ai/oumi/compare/v0.1-alpha...v0.1.1-alpha

0.1alpha

What's Changed
* Add python project configs by oelachqar in https://github.com/oumi-ai/oumi/pull/1
* Add repo skeleton by oelachqar in https://github.com/oumi-ai/oumi/pull/2
* Export lema entrypoint scripts by oelachqar in https://github.com/oumi-ai/oumi/pull/3
* Update static type checking config by oelachqar in https://github.com/oumi-ai/oumi/pull/5
* Add example jupyter / colab notebook by oelachqar in https://github.com/oumi-ai/oumi/pull/4
* Refactor config parsing to use omegaconf by oelachqar in https://github.com/oumi-ai/oumi/pull/6
* Updating documentation (Dev Environment Setup) by kaisopos in https://github.com/oumi-ai/oumi/pull/7
* Add tests and vscode config by oelachqar in https://github.com/oumi-ai/oumi/pull/8
* Added DPOTrainer example to repo, as well as cuda device cleanup to training loop by jgreer013 in https://github.com/oumi-ai/oumi/pull/9
* Adding torch as top-level module dependency by optas in https://github.com/oumi-ai/oumi/pull/10
* Add configs for specific hardware requirements by jgreer013 in https://github.com/oumi-ai/oumi/pull/11
* Sort pre-commit hooks lexicographically by xrdaukar in https://github.com/oumi-ai/oumi/pull/12
* Add logging config by oelachqar in https://github.com/oumi-ai/oumi/pull/13
* Lema inference by xrdaukar in https://github.com/oumi-ai/oumi/pull/14
* Panos dev by optas in https://github.com/oumi-ai/oumi/pull/16
* Add job launcher by oelachqar in https://github.com/oumi-ai/oumi/pull/15
* Making split of data a flexible variable by optas in https://github.com/oumi-ai/oumi/pull/17
* Configure max file size in precommit hooks by xrdaukar in https://github.com/oumi-ai/oumi/pull/18
* Minor bugfix and documentation update by oelachqar in https://github.com/oumi-ai/oumi/pull/19
* adding pynvml to train env by kaisopos in https://github.com/oumi-ai/oumi/pull/20
* Panos dev by optas in https://github.com/oumi-ai/oumi/pull/22
* Augmenting Types for training hyperparams by optas in https://github.com/oumi-ai/oumi/pull/23
* Train refactoring (config file visibility) + a few minor changes by kaisopos in https://github.com/oumi-ai/oumi/pull/21
* Minimal test for train function by xrdaukar in https://github.com/oumi-ai/oumi/pull/25
* Fix leftover '_torch_dtype' in 'ModelParams' by xrdaukar in https://github.com/oumi-ai/oumi/pull/26
* Update GPU types list in the default SkyPilot config by xrdaukar in https://github.com/oumi-ai/oumi/pull/27
* Add a missing lema-infer command under [project.scripts] by xrdaukar in https://github.com/oumi-ai/oumi/pull/28
* add basic pytests for evaluate and infer by xrdaukar in https://github.com/oumi-ai/oumi/pull/29
* Update README and pyproject.toml by wizeng23 in https://github.com/oumi-ai/oumi/pull/30
* A helper function to print info about available CUDA devices by xrdaukar in https://github.com/oumi-ai/oumi/pull/31
* Update SkyPilot cconfig to start using torchrun by xrdaukar in https://github.com/oumi-ai/oumi/pull/32
* Support basic single-node, multi-gpu training by xrdaukar in https://github.com/oumi-ai/oumi/pull/33
* Run all precommit hooks on the repo by xrdaukar in https://github.com/oumi-ai/oumi/pull/35
* Add experimental code for llama cpp inference by jgreer013 in https://github.com/oumi-ai/oumi/pull/37
* Create skeleton of STYLE_GUIDE.md by xrdaukar in https://github.com/oumi-ai/oumi/pull/36
* Adding support for training custom models (for now just a dummy model). by kaisopos in https://github.com/oumi-ai/oumi/pull/38
* Fix custom model name in test_train.py by xrdaukar in https://github.com/oumi-ai/oumi/pull/39
* Configure pyright (static type checker) and resolve existing type errors to make it pass by xrdaukar in https://github.com/oumi-ai/oumi/pull/41
* fix trailing whitespace warning in STYLE_GUIDE.md by xrdaukar in https://github.com/oumi-ai/oumi/pull/43
* Configure initial GitHub Actions workflow to run pre-commits and tests by xrdaukar in https://github.com/oumi-ai/oumi/pull/44
* A variety of proposed extensions to finetune a chat-based model (starting with Zephyr) by optas in https://github.com/oumi-ai/oumi/pull/34
* Fix syntax error in ultrachat by xrdaukar in https://github.com/oumi-ai/oumi/pull/48
* Create initial version of CONTRIBUTING.md by xrdaukar in https://github.com/oumi-ai/oumi/pull/46
* Reduce the number of training steps from 5 to 3 to make test_train.py faster by xrdaukar in https://github.com/oumi-ai/oumi/pull/49
* Adding registry for custom models. by kaisopos in https://github.com/oumi-ai/oumi/pull/42
* Add config and streaming args to DataParams by wizeng23 in https://github.com/oumi-ai/oumi/pull/47
* Update Pre-review Tests to only run on pull_request by xrdaukar in https://github.com/oumi-ai/oumi/pull/50
* Add training flags to computes tokens-based stats by xrdaukar in https://github.com/oumi-ai/oumi/pull/51
* reduce test training steps in another test which I missed before by xrdaukar in https://github.com/oumi-ai/oumi/pull/53
* Rename var names of *Params classes by wizeng23 in https://github.com/oumi-ai/oumi/pull/52
* Make some NVIDIA-specific dependencies optional by xrdaukar in https://github.com/oumi-ai/oumi/pull/54
* fix trl version as 0.8.6 by xrdaukar in https://github.com/oumi-ai/oumi/pull/56
* Remove reference to torch.cuda.clock_rate by xrdaukar in https://github.com/oumi-ai/oumi/pull/57
* Update inference to support non-interactive batch mode. by kaisopos in https://github.com/oumi-ai/oumi/pull/58
* Update README.md to include Linux/WSL specific instructions by xrdaukar in https://github.com/oumi-ai/oumi/pull/59
* Minor formatting improvements in README.md by xrdaukar in https://github.com/oumi-ai/oumi/pull/60
* Minor: Updating Lora Params by optas in https://github.com/oumi-ai/oumi/pull/55
* Support dataset packing by wizeng23 in https://github.com/oumi-ai/oumi/pull/63
* Disallow relative imports in LeMa by xrdaukar in https://github.com/oumi-ai/oumi/pull/65
* Add text_col param that's required for SFTTrainer by wizeng23 in https://github.com/oumi-ai/oumi/pull/66
* Refactor common config parsing logic (YAML, arg_list) into a common util by xrdaukar in https://github.com/oumi-ai/oumi/pull/68
* Standardize test naming convention by wizeng23 in https://github.com/oumi-ai/oumi/pull/69
* Adding support for a hardcoded evaluation with MMLU. by kaisopos in https://github.com/oumi-ai/oumi/pull/67
* Minor changes to the default configs/skypilot/sky.yaml config by xrdaukar in https://github.com/oumi-ai/oumi/pull/71
* Prototype to pass `config.model.model_max_length` to Trainers by xrdaukar in https://github.com/oumi-ai/oumi/pull/70
* [Inference] Remove the prepended prompts from model responses. by kaisopos in https://github.com/oumi-ai/oumi/pull/73
* Add a util to print versioning info by xrdaukar in https://github.com/oumi-ai/oumi/pull/74
* Switch to tempfile.TemporaryDirectory() in test_train.py by xrdaukar in https://github.com/oumi-ai/oumi/pull/75
* Update docstring verbs to descriptive form by wizeng23 in https://github.com/oumi-ai/oumi/pull/76
* Add sample accelerate and fsdp configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/77
* Refactor code to get device rank and world size into a helper function by xrdaukar in https://github.com/oumi-ai/oumi/pull/79
* Add a simple util to print model summary e.g., layer names, architecture summary by xrdaukar in https://github.com/oumi-ai/oumi/pull/80
* Freeze numpy to pre 2.0 version by xrdaukar in https://github.com/oumi-ai/oumi/pull/81
* Adding inference support for next logit probability. by kaisopos in https://github.com/oumi-ai/oumi/pull/78
* Create FSDP configs for Phi3 by xrdaukar in https://github.com/oumi-ai/oumi/pull/82
* Auto-format pyproject.toml with "Even Better TOML" by xrdaukar in https://github.com/oumi-ai/oumi/pull/83
* Minor cleanup updates to SkyPilot configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/84
* Mixed Precision Training, Flash-Attention-2, Print-trainable-params by optas in https://github.com/oumi-ai/oumi/pull/85
* Update README.md to include basic instructions for multi-GPU training (DDP, FSDP) by xrdaukar in https://github.com/oumi-ai/oumi/pull/86
* Start using $SKYPILOT_NUM_GPUS_PER_NODE in SkyPilot config by xrdaukar in https://github.com/oumi-ai/oumi/pull/90
* Add configs for FineWeb Llama2 pretraining by wizeng23 in https://github.com/oumi-ai/oumi/pull/89
* Quantization by optas in https://github.com/oumi-ai/oumi/pull/87
* Update the default SkyPilot config to print more debug/context info by xrdaukar in https://github.com/oumi-ai/oumi/pull/92
* Add license by oelachqar in https://github.com/oumi-ai/oumi/pull/93
* Initial version of SkyPilot config for multi-node training (num_nodes: N) by xrdaukar in https://github.com/oumi-ai/oumi/pull/94
* MMLU eval refactor. by kaisopos in https://github.com/oumi-ai/oumi/pull/88
* Remove comparison between LOCAL_RANK and RANK by xrdaukar in https://github.com/oumi-ai/oumi/pull/96
* Handling the loading of peft adapters and other minor issues (e.g., adding more logging parameters) by optas in https://github.com/oumi-ai/oumi/pull/91
* Update configs/skypilot/sky_llama2b.yaml to start using sky_init.sh by xrdaukar in https://github.com/oumi-ai/oumi/pull/97
* Add bool param to resume training from the last known checkpoint (if exists) by xrdaukar in https://github.com/oumi-ai/oumi/pull/99
* Inference: save/restore probabilities to/from file. by kaisopos in https://github.com/oumi-ai/oumi/pull/98
* Add support for dataset mixtures during training by taenin in https://github.com/oumi-ai/oumi/pull/95
* Add train, test, and validation splits to the LeMa config. by taenin in https://github.com/oumi-ai/oumi/pull/101
* nanoGPT (GPT2) pretraining recipe by wizeng23 in https://github.com/oumi-ai/oumi/pull/103
* Minor: Updates on Zephyr-Config by optas in https://github.com/oumi-ai/oumi/pull/106
* Update pre-commit config by oelachqar in https://github.com/oumi-ai/oumi/pull/108
* Add integration tests that verify all configs load properly. by taenin in https://github.com/oumi-ai/oumi/pull/102
* Handling Gradient Checkpointing by optas in https://github.com/oumi-ai/oumi/pull/107
* Update skypilot/sky_gpt2.yaml to include an example how to mount GCS dir by xrdaukar in https://github.com/oumi-ai/oumi/pull/111
* Rename dataset_params.dataset_config to dataset_params.subset by oelachqar in https://github.com/oumi-ai/oumi/pull/109
* Refactor SFT dataset preprocessing by oelachqar in https://github.com/oumi-ai/oumi/pull/112
* Support shuffling and random seeds for dataset sampling by taenin in https://github.com/oumi-ai/oumi/pull/113
* Split types file into module by oelachqar in https://github.com/oumi-ai/oumi/pull/114
* Add GCP deps to `lema[cloud]` by xrdaukar in https://github.com/oumi-ai/oumi/pull/117
* Add llama3-instruct jinja template by jgreer013 in https://github.com/oumi-ai/oumi/pull/118
* Update sky_init.sh to print current dir by xrdaukar in https://github.com/oumi-ai/oumi/pull/120
* Add prompt response sft preprocessor factory for aya dataset by jgreer013 in https://github.com/oumi-ai/oumi/pull/121
* Add configs for chatqa model by oelachqar in https://github.com/oumi-ai/oumi/pull/110
* Saving inference probs in `parquet` format. by kaisopos in https://github.com/oumi-ai/oumi/pull/115
* Refactor model registry by oelachqar in https://github.com/oumi-ai/oumi/pull/122
* Define BaseTrainer abstraction by xrdaukar in https://github.com/oumi-ai/oumi/pull/116
* Add a registry for metric functions that we can run during training. by taenin in https://github.com/oumi-ai/oumi/pull/126
* Update training_params.py so HF trainer uses num_train_epochs by optas in https://github.com/oumi-ai/oumi/pull/125
* Add native PyTorch model training by oelachqar in https://github.com/oumi-ai/oumi/pull/123
* [Quick fix] Handle pynvml being misconfigured by taenin in https://github.com/oumi-ai/oumi/pull/128
* Enable DP for inference by kaisopos in https://github.com/oumi-ai/oumi/pull/100
* Add configs for training llama3-8b with aya finetune by jgreer013 in https://github.com/oumi-ai/oumi/pull/130
* Update HF save_model() to only save on master replica by xrdaukar in https://github.com/oumi-ai/oumi/pull/131
* Pipe MetricsFunction from our config to train.py by taenin in https://github.com/oumi-ai/oumi/pull/129
* Fixing broken eval. by kaisopos in https://github.com/oumi-ai/oumi/pull/132
* Minor updates in SkyPilot docstrings by xrdaukar in https://github.com/oumi-ai/oumi/pull/133
* Fix bug with DP evaluation by oelachqar in https://github.com/oumi-ai/oumi/pull/134
* [MMLU custom eval] removing hardcoded subject, samples, num-shots. by kaisopos in https://github.com/oumi-ai/oumi/pull/135
* Add an initial config for async evaluations by taenin in https://github.com/oumi-ai/oumi/pull/137
* Add a new top level command: evaluate_async by taenin in https://github.com/oumi-ai/oumi/pull/138
* Minor bug fix in writing evaluations by taenin in https://github.com/oumi-ai/oumi/pull/140
* Support full GPT2 run by wizeng23 in https://github.com/oumi-ai/oumi/pull/141
* Upload sample configs for running async evals on GPT2 by taenin in https://github.com/oumi-ai/oumi/pull/139
* Apply `torch.distributed.barrier()` in save_model by xrdaukar in https://github.com/oumi-ai/oumi/pull/136
* Create an experimental util to generate pre-tokenized datasets (Parquet files) with `token_ids` column by xrdaukar in https://github.com/oumi-ai/oumi/pull/144
* Created a new dataset class with async loading & tokenization by jgreer013 in https://github.com/oumi-ai/oumi/pull/142
* Remove private debug dir from configs/skypilot/sky_gpt2.yaml by xrdaukar in https://github.com/oumi-ai/oumi/pull/145
* Define dataloader_num_workers and dataloader_prefetch_factor params by xrdaukar in https://github.com/oumi-ai/oumi/pull/146
* [Evaluations] Integration with LM Evaluation Harness by kaisopos in https://github.com/oumi-ai/oumi/pull/143
* Support model compilation by wizeng23 in https://github.com/oumi-ai/oumi/pull/147
* Multiple cleanup changes in configs/skypilot/sky_gpt2.yaml by xrdaukar in https://github.com/oumi-ai/oumi/pull/148
* Update SkyPilot training configs to include `run_name` by xrdaukar in https://github.com/oumi-ai/oumi/pull/149
* Update async eval to properly parse eval configs by taenin in https://github.com/oumi-ai/oumi/pull/150
* Zephyr Configs [full-model, skypilot] by optas in https://github.com/oumi-ai/oumi/pull/152
* Disable model.compile in gpt2 config by xrdaukar in https://github.com/oumi-ai/oumi/pull/154
* Update sky_init.sh to print task id and cluster info by xrdaukar in https://github.com/oumi-ai/oumi/pull/156
* [bug] Include jinja templates in build by oelachqar in https://github.com/oumi-ai/oumi/pull/158
* Add basic scaffolding for torch profiler around training loop by xrdaukar in https://github.com/oumi-ai/oumi/pull/157
* [Minor] Adding `attn_implementation` arg in LM Harness. by kaisopos in https://github.com/oumi-ai/oumi/pull/160
* Update Trainer.save_model to start using the public HF save_model() method (except for PEFT) by xrdaukar in https://github.com/oumi-ai/oumi/pull/161
* Update the vanilla eval config for gpt2 to run hellaswag evals. by taenin in https://github.com/oumi-ai/oumi/pull/165
* Add Dataset base class & API by oelachqar in https://github.com/oumi-ai/oumi/pull/151
* Add experimental notebook to run Nvidia's ChatRAG-Bench evaluation by oelachqar in https://github.com/oumi-ai/oumi/pull/166
* Update ChatQA training configs by oelachqar in https://github.com/oumi-ai/oumi/pull/159
* Update async dataset class to support pre-tokenized datasets by oelachqar in https://github.com/oumi-ai/oumi/pull/162
* Create a launcher script for Polaris jobs (ALCF) by taenin in https://github.com/oumi-ai/oumi/pull/164
* Update pre-tokenized column name to be `input_ids` in `tokenize_dataset` tool by xrdaukar in https://github.com/oumi-ai/oumi/pull/167
* Replacing `EvaluationConfig`'s `DataParams` with `DatasetSplitParams` by kaisopos in https://github.com/oumi-ai/oumi/pull/168
* Submit config to create Custom IAM role for SkyPilot Service Accounts on GCP by xrdaukar in https://github.com/oumi-ai/oumi/pull/169
* Remove GCP project reference by xrdaukar in https://github.com/oumi-ai/oumi/pull/172
* Make sure output training dir exists by xrdaukar in https://github.com/oumi-ai/oumi/pull/171
* Improve launcher usability via command line arguments. by taenin in https://github.com/oumi-ai/oumi/pull/170
* Add a source directory to the Polaris launcher and clean up rsync copies. by taenin in https://github.com/oumi-ai/oumi/pull/173
* Introduce LEMA_RUN_NAME env var to SkyPilot training configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/174
* Minor changes: 1. Remove hardcoded HF_TOKEN 2. Log effective training config by xrdaukar in https://github.com/oumi-ai/oumi/pull/175
* Tweak default params in gpt2 scripts by xrdaukar in https://github.com/oumi-ai/oumi/pull/177
* LM Harness optimizations by kaisopos in https://github.com/oumi-ai/oumi/pull/176
* No longer ignore .git. in Polaris Needed for venv. by taenin in https://github.com/oumi-ai/oumi/pull/179
* A hack for running jobs on Polaris. by taenin in https://github.com/oumi-ai/oumi/pull/180
* [Polaris] Move venv creation from worker to launcher. by taenin in https://github.com/oumi-ai/oumi/pull/181
* Update README.md to include `sky launch - 10 ...` example by xrdaukar in https://github.com/oumi-ai/oumi/pull/182
* [Evaluations] Adding support for HuggingFace's leaderboard v1 benchmarks by kaisopos in https://github.com/oumi-ai/oumi/pull/183
* Llama 3 Aya Fine-Tuning Updates by jgreer013 in https://github.com/oumi-ai/oumi/pull/163
* Remove logger propagation by wizeng23 in https://github.com/oumi-ai/oumi/pull/185
* [Evaluations] HF leaderboard v1 configs by kaisopos in https://github.com/oumi-ai/oumi/pull/186
* Move logging.py to utils by wizeng23 in https://github.com/oumi-ai/oumi/pull/187
* Create the Jobs config for the lema launcher. by taenin in https://github.com/oumi-ai/oumi/pull/188
* Initial abstract base classes for the lema launcher. by taenin in https://github.com/oumi-ai/oumi/pull/189
* Added mfu calculation and tests by jgreer013 in https://github.com/oumi-ai/oumi/pull/190
* Introduce two new training params: save_model and save_epoch by xrdaukar in https://github.com/oumi-ai/oumi/pull/191
* Update FineWeb ablation model configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/196
* Added MFU telemetry by jgreer013 in https://github.com/oumi-ai/oumi/pull/193
* Update Polaris script by wizeng23 in https://github.com/oumi-ai/oumi/pull/192
* Rename `training.save_model` param to `training.save_final_model` for clarity by xrdaukar in https://github.com/oumi-ai/oumi/pull/197
* Support disabling dropout by wizeng23 in https://github.com/oumi-ai/oumi/pull/184
* Update actual mfu calculation by jgreer013 in https://github.com/oumi-ai/oumi/pull/199
* Implement a client for talking to SkyPilot. by taenin in https://github.com/oumi-ai/oumi/pull/201
* Fixed miscalculation of second step start time by jgreer013 in https://github.com/oumi-ai/oumi/pull/202
* Update `ablation-model-fineweb-v1` config to start using grad checkpointing by xrdaukar in https://github.com/oumi-ai/oumi/pull/198
* Add distributed operations by oelachqar in https://github.com/oumi-ai/oumi/pull/194
* Add pre-commit hooks for credential scanning + new checks by oelachqar in https://github.com/oumi-ai/oumi/pull/195
* Sample job for multi-node training by xrdaukar in https://github.com/oumi-ai/oumi/pull/203
* Update Polaris multi-node launcher by xrdaukar in https://github.com/oumi-ai/oumi/pull/204
* Multi-node config improvements for llama2b model (`HuggingFaceFW/ablation-model-fineweb-v1`) by xrdaukar in https://github.com/oumi-ai/oumi/pull/205
* Minor updates to Polaris launcher script by xrdaukar in https://github.com/oumi-ai/oumi/pull/206
* Update Lema FSDP configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/207
* [tiny] add default formatter for markdown by oelachqar in https://github.com/oumi-ai/oumi/pull/210
* Preparations for Lema custom pre-training loop by oelachqar in https://github.com/oumi-ai/oumi/pull/208
* Update MFU callback to support Lema trainer by oelachqar in https://github.com/oumi-ai/oumi/pull/209
* Configure llama2b model to use FSDP HYBRID_SHARD by xrdaukar in https://github.com/oumi-ai/oumi/pull/213
* Implement a Cluster resource manager around Sky Pilot. by taenin in https://github.com/oumi-ai/oumi/pull/214
* Add utils to setup distributed training by oelachqar in https://github.com/oumi-ai/oumi/pull/211
* Add example notebook to train NanoGPT model with Lema by oelachqar in https://github.com/oumi-ai/oumi/pull/212
* [tiny] update sky pilot ssh config by oelachqar in https://github.com/oumi-ai/oumi/pull/215
* Implement a Cloud resource manager around Sky Pilot by taenin in https://github.com/oumi-ai/oumi/pull/216
* Sanitize run name by xrdaukar in https://github.com/oumi-ai/oumi/pull/217
* Use "cluster_name" instead of "name" in the Sky client. by taenin in https://github.com/oumi-ai/oumi/pull/218
* Minor logging improvements in Polaris sample job scripts by xrdaukar in https://github.com/oumi-ai/oumi/pull/219
* Update shell scripts to point to local dataset by jgreer013 in https://github.com/oumi-ai/oumi/pull/221
* Support FSDP on Polaris using accelerate by xrdaukar in https://github.com/oumi-ai/oumi/pull/220
* Add telemetry manager by oelachqar in https://github.com/oumi-ai/oumi/pull/222
* Switch to the latest transformers=4.43.1 by xrdaukar in https://github.com/oumi-ai/oumi/pull/223
* Re-enable model compilation for llama2b model by xrdaukar in https://github.com/oumi-ai/oumi/pull/224
* Increase llama2b batch size from 2 to 3 by xrdaukar in https://github.com/oumi-ai/oumi/pull/225
* Add makefile with common local commands by oelachqar in https://github.com/oumi-ai/oumi/pull/227
* Add DeepSpeed config for Llama2b by wizeng23 in https://github.com/oumi-ai/oumi/pull/228
* MFU Improvements for Llama 2B on Polaris by jgreer013 in https://github.com/oumi-ai/oumi/pull/229
* FSDP config updates by xrdaukar in https://github.com/oumi-ai/oumi/pull/231
* Rename accelerate configs to be in line with other configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/232
* [tiny] Update logger format to include rank, pid and threadname by oelachqar in https://github.com/oumi-ai/oumi/pull/235
* Set model.config.use_cache = False by xrdaukar in https://github.com/oumi-ai/oumi/pull/233
* Experimental training loop for pre-training by oelachqar in https://github.com/oumi-ai/oumi/pull/230
* Disable gradient checkpointing in SkyPilot llama2b config by xrdaukar in https://github.com/oumi-ai/oumi/pull/236
* Implement a client for communicating with Polaris via python. by taenin in https://github.com/oumi-ai/oumi/pull/234
* Add SkyPilot config for `experimental/pretokenize/tokenize_dataset.py` by xrdaukar in https://github.com/oumi-ai/oumi/pull/237
* Update Fabric.run() calls to use the "warn" flag. by taenin in https://github.com/oumi-ai/oumi/pull/239
* Update `pretokenize` tool to support input datasets by xrdaukar in https://github.com/oumi-ai/oumi/pull/238
* Add optimizers builder function by oelachqar in https://github.com/oumi-ai/oumi/pull/240
* Add a "put" method in the Polaris client for writing remote files. by taenin in https://github.com/oumi-ai/oumi/pull/242
* Add deepspeed (DS) config to support hierarchical partitioning by wizeng23 in https://github.com/oumi-ai/oumi/pull/244
* Add support for uploading MFU in wandb by jgreer013 in https://github.com/oumi-ai/oumi/pull/245
* Create a Polaris Cluster class consuming the polaris client by taenin in https://github.com/oumi-ai/oumi/pull/246
* Add initial docker image by oelachqar in https://github.com/oumi-ai/oumi/pull/241
* Fix a string in the Polaris Cluster tests. by taenin in https://github.com/oumi-ai/oumi/pull/249
* Set training loop random seeds by oelachqar in https://github.com/oumi-ai/oumi/pull/248
* Fix bug with Polaris multi-node script by wizeng23 in https://github.com/oumi-ai/oumi/pull/247
* Add torchfix listing target by oelachqar in https://github.com/oumi-ai/oumi/pull/250
* Add training state classes by oelachqar in https://github.com/oumi-ai/oumi/pull/251
* Save and restore telemetry state during training by oelachqar in https://github.com/oumi-ai/oumi/pull/252
* Configure file logging by oelachqar in https://github.com/oumi-ai/oumi/pull/254
* Create a Polaris Cloud class consuming the polaris client by taenin in https://github.com/oumi-ai/oumi/pull/253
* Define a registry for cloud builders. by taenin in https://github.com/oumi-ai/oumi/pull/255
* Add logging to tensor board, wandb in custom training loop by oelachqar in https://github.com/oumi-ai/oumi/pull/256
* Add a `get_all` utility method to the LeMa Registry by taenin in https://github.com/oumi-ai/oumi/pull/257
* Update the BaseCloud `up_cluster` definition to return a job status. by taenin in https://github.com/oumi-ai/oumi/pull/258
* Create a launcher class for the LeMa Launcher. by taenin in https://github.com/oumi-ai/oumi/pull/261
* Add script to benchmark datasets and data loader params by oelachqar in https://github.com/oumi-ai/oumi/pull/260
* [Follow-up] data loader benchmarking script by oelachqar in https://github.com/oumi-ai/oumi/pull/262
* Create DDP configs for `accelerate` by xrdaukar in https://github.com/oumi-ai/oumi/pull/259
* Switch from nightly to stable version of SkyPilot by xrdaukar in https://github.com/oumi-ai/oumi/pull/264
* Make all tests green by xrdaukar in https://github.com/oumi-ai/oumi/pull/265
* Set `dataloader_pin_memory=True` to be intentional by xrdaukar in https://github.com/oumi-ai/oumi/pull/266
* Move `torch_profiler_utils` from `lema.utils` to `lema.perfomance` by xrdaukar in https://github.com/oumi-ai/oumi/pull/267
* Add BaseIterableDataset, refactor DataLoader to use DataPipes by oelachqar in https://github.com/oumi-ai/oumi/pull/263
* Add a `dataset_kwargs` attribute, tests by oelachqar in https://github.com/oumi-ai/oumi/pull/268
* Use stateful dataloader by oelachqar in https://github.com/oumi-ai/oumi/pull/269
* Update the polaris client / cluster to work e2e by taenin in https://github.com/oumi-ai/oumi/pull/270
* Update package structure for the launcher by taenin in https://github.com/oumi-ai/oumi/pull/273
* [tiny] Register debug datasets by oelachqar in https://github.com/oumi-ai/oumi/pull/272
* Update several of our launcher base fields to use strings instead of ints. by taenin in https://github.com/oumi-ai/oumi/pull/274
* Configure data loader sampling strategy for map-style datasets by oelachqar in https://github.com/oumi-ai/oumi/pull/271
* Ensure we CD into the working DIR before submitting polaris jobs. by taenin in https://github.com/oumi-ai/oumi/pull/276
* Compute the number of dataloader workers per node by xrdaukar in https://github.com/oumi-ai/oumi/pull/277
* Introduce BaseTokenizer alias by xrdaukar in https://github.com/oumi-ai/oumi/pull/280
* Cache get_device_rank_info by xrdaukar in https://github.com/oumi-ai/oumi/pull/279
* Adding initial scripts for running polaris jobs. by taenin in https://github.com/oumi-ai/oumi/pull/275
* Update the polaris client to automatically set execute permissions for copied files. by taenin in https://github.com/oumi-ai/oumi/pull/286
* Deprecate building models data parallel by oelachqar in https://github.com/oumi-ai/oumi/pull/282
* Switch to using safetensors when saving models by oelachqar in https://github.com/oumi-ai/oumi/pull/281
* Add ability to validate configs and params after init by oelachqar in https://github.com/oumi-ai/oumi/pull/285
* Some updates to Polaris launcher script by xrdaukar in https://github.com/oumi-ai/oumi/pull/287
* Upgrade to latest TRL version, remove numpy version condition by oelachqar in https://github.com/oumi-ai/oumi/pull/283
* Add learning rate builder function by oelachqar in https://github.com/oumi-ai/oumi/pull/284
* Remove patchwork as a dep. by taenin in https://github.com/oumi-ai/oumi/pull/290
* Set up initial demo launcher jobs for GCP. by taenin in https://github.com/oumi-ai/oumi/pull/288
* [tiny] cleanup pyproject.toml dependencies by oelachqar in https://github.com/oumi-ai/oumi/pull/292
* Make dataset data backend attribute read-only by oelachqar in https://github.com/oumi-ai/oumi/pull/291
* Optimize Github actions by oelachqar in https://github.com/oumi-ai/oumi/pull/289
* Misc minor changes by xrdaukar in https://github.com/oumi-ai/oumi/pull/293
* [tiny] Update GitHub action cache version by oelachqar in https://github.com/oumi-ai/oumi/pull/295
* Rename 'NodeParams' -> 'JobResources' by taenin in https://github.com/oumi-ai/oumi/pull/296
* Disable compilation for DDP `accelerate launch` config by xrdaukar in https://github.com/oumi-ai/oumi/pull/297
* Export top level launcher functions and instantiate a default launcher. by taenin in https://github.com/oumi-ai/oumi/pull/298
* Prevent HF version bump by taenin in https://github.com/oumi-ai/oumi/pull/300
* Add dtype/mixed precision configs to Lema trainer by wizeng23 in https://github.com/oumi-ai/oumi/pull/278
* Create a notebook tutorial for running remote training. by taenin in https://github.com/oumi-ai/oumi/pull/299
* Increase the default value of `ProfilerParams.row_limit` from 20 to 50 by xrdaukar in https://github.com/oumi-ai/oumi/pull/304
* Mini guide on using basic lema functionality by oelachqar in https://github.com/oumi-ai/oumi/pull/303
* Compute MFU based of HF `total_flos` (alternative way to compute MFU) by xrdaukar in https://github.com/oumi-ai/oumi/pull/301
* Support GPT2 training with Lema trainer by wizeng23 in https://github.com/oumi-ai/oumi/pull/302
* Add a client for running local jobs via the launcher. by taenin in https://github.com/oumi-ai/oumi/pull/305
* Add a local cluster for running local jobs. by taenin in https://github.com/oumi-ai/oumi/pull/306
* Support llama2b with lema trainer by wizeng23 in https://github.com/oumi-ai/oumi/pull/308
* Add a convenience method for listing all registered clouds. by taenin in https://github.com/oumi-ai/oumi/pull/310
* [ALCF] Reverse Polaris GPU order to match CPU/GPU affinities by xrdaukar in https://github.com/oumi-ai/oumi/pull/307
* Create a local cloud for the LeMa launcher. by taenin in https://github.com/oumi-ai/oumi/pull/309
* Remove some leftover occurrences of `builtin_` prefix in HF MFU callback by xrdaukar in https://github.com/oumi-ai/oumi/pull/312
* Clean up mixed precision params by wizeng23 in https://github.com/oumi-ai/oumi/pull/311
* Add finetuning tutorial by oelachqar in https://github.com/oumi-ai/oumi/pull/313
* Fix interpolation when loading lema configs. by taenin in https://github.com/oumi-ai/oumi/pull/314
* [bugfix] GPU workers not waiting for global leader to save final checkpoint by oelachqar in https://github.com/oumi-ai/oumi/pull/315
* Add simple benchmark script for distributed operations by oelachqar in https://github.com/oumi-ai/oumi/pull/316
* Add a 'done' field to the LeMa job status object. by taenin in https://github.com/oumi-ai/oumi/pull/317
* Fix a small typo in Lema README by xrdaukar in https://github.com/oumi-ai/oumi/pull/318
* Add pytorch profiler (`-p`) option to `multinode_example_worker.sh` script by xrdaukar in https://github.com/oumi-ai/oumi/pull/319
* Create a simpler tutorial for running jobs. by taenin in https://github.com/oumi-ai/oumi/pull/320
* Minor cleanups in Lema training loop by xrdaukar in https://github.com/oumi-ai/oumi/pull/322
* Remove unbalanced call to `barrier()` in `HuggingFaceTrainer.save_model` by xrdaukar in https://github.com/oumi-ai/oumi/pull/323
* Create a tutorial for custom clouds. by taenin in https://github.com/oumi-ai/oumi/pull/321
* Add support for logging stdout and stderr for Local runs. by taenin in https://github.com/oumi-ai/oumi/pull/324
* Fix nanoGPT notebook by wizeng23 in https://github.com/oumi-ai/oumi/pull/325
* Add more pytorch profiler instrumentations in Lema training loop by xrdaukar in https://github.com/oumi-ai/oumi/pull/327
* Add training param: `dataloader_main_process_only` by xrdaukar in https://github.com/oumi-ai/oumi/pull/326
* fix synchronization issues in LEMA training loop by xrdaukar in https://github.com/oumi-ai/oumi/pull/328
* Update LEMA training loop to count tokens on CPU by xrdaukar in https://github.com/oumi-ai/oumi/pull/330
* Update README.md by taenin in https://github.com/oumi-ai/oumi/pull/331
* Add various improvements to Lema trainer by wizeng23 in https://github.com/oumi-ai/oumi/pull/329
* Add PyTorch profiler annotation for each step/micro-step by xrdaukar in https://github.com/oumi-ai/oumi/pull/333
* Enable `HfMfuTrainerCallback` if supported by xrdaukar in https://github.com/oumi-ai/oumi/pull/332
* Add support for PyTorch profiling schedule by xrdaukar in https://github.com/oumi-ai/oumi/pull/334
* Set up Sphinx-based doc generation for LeMa by taenin in https://github.com/oumi-ai/oumi/pull/335
* Fix dataclass strings to be parsable by our docs generator. by taenin in https://github.com/oumi-ai/oumi/pull/337
* Update ProfilerStepCallback to add `microstep` profiler annotations by xrdaukar in https://github.com/oumi-ai/oumi/pull/338
* Add `include_alternative_mfu_metrics` param to control if HF MFU is enabled by xrdaukar in https://github.com/oumi-ai/oumi/pull/336
* Minor doc formatting updates. by taenin in https://github.com/oumi-ai/oumi/pull/340
* Add 8-bit Adam optimizer to Lema trainer by wizeng23 in https://github.com/oumi-ai/oumi/pull/339
* Enable gradient scaling for fp16 mixed-precision training by wizeng23 in https://github.com/oumi-ai/oumi/pull/342
* Add a link to our documentation via the readme. by taenin in https://github.com/oumi-ai/oumi/pull/344
* Disable weight decay for layernorm/biases in Lema trainer by wizeng23 in https://github.com/oumi-ai/oumi/pull/341
* Polaris: Enable NCCL debug logging at WARNING level by xrdaukar in https://github.com/oumi-ai/oumi/pull/347
* Add a new notebook for getting started. by taenin in https://github.com/oumi-ai/oumi/pull/345
* Create `TelemetryCallback` by xrdaukar in https://github.com/oumi-ai/oumi/pull/343
* Various improvements for our autogenerated docs by taenin in https://github.com/oumi-ai/oumi/pull/349
* Polaris: update sample `tail` command to use `-n200` by xrdaukar in https://github.com/oumi-ai/oumi/pull/348
* Fix a minor bug in `TelemetryCallback.on_train_end` by xrdaukar in https://github.com/oumi-ai/oumi/pull/350
* Update LEMA training loop to log wandb url by xrdaukar in https://github.com/oumi-ai/oumi/pull/351
* Update model dtype for DeepSpeed to make it work with SkyPilot and Polaris by xrdaukar in https://github.com/oumi-ai/oumi/pull/352
* Enable the launcher via the CLI by taenin in https://github.com/oumi-ai/oumi/pull/353
* Update Polaris init script to print nodelist by xrdaukar in https://github.com/oumi-ai/oumi/pull/354
* Minor logging updates in Polaris scripts by xrdaukar in https://github.com/oumi-ai/oumi/pull/355
* Define `ddp1gpu` Polaris mode: Spawn 1 `torchrun` process per GPU (4 `torchrun`-s per node) by xrdaukar in https://github.com/oumi-ai/oumi/pull/356
* Add a helper util to query GPU temperatures by xrdaukar in https://github.com/oumi-ai/oumi/pull/359
* Add Llama 8B config by wizeng23 in https://github.com/oumi-ai/oumi/pull/358
* Add another `bareer()` call before train() by xrdaukar in https://github.com/oumi-ai/oumi/pull/360
* Add Llama70B FSDP config by wizeng23 in https://github.com/oumi-ai/oumi/pull/361
* Minor improvements in logging and instrumentations in `train.py` by xrdaukar in https://github.com/oumi-ai/oumi/pull/362
* Refactor our core directory to logically organize our classes. by taenin in https://github.com/oumi-ai/oumi/pull/357
* Basic plumbing for GPU temperature telemetry by xrdaukar in https://github.com/oumi-ai/oumi/pull/363
* Minor update to Llama70B by wizeng23 in https://github.com/oumi-ai/oumi/pull/365
* Reorder model compilation and DDP/FSDP wrapping by xrdaukar in https://github.com/oumi-ai/oumi/pull/364
* Mini tutorial for Llama3.1-70b inference on Polaris. by taenin in https://github.com/oumi-ai/oumi/pull/367
* jgreer013/vllm-inference by jgreer013 in https://github.com/oumi-ai/oumi/pull/366
* Fix interpolation when using the launcher CLI for various sky configs. by taenin in https://github.com/oumi-ai/oumi/pull/369
* Add Llama8B Lora config for GCP/Polaris by wizeng23 in https://github.com/oumi-ai/oumi/pull/368
* Add vllm parallel inference to improve throughput by jgreer013 in https://github.com/oumi-ai/oumi/pull/370
* Set `TOKENIZERS_PARALLELISM: false` for llama8b model by xrdaukar in https://github.com/oumi-ai/oumi/pull/371
* Disable MFU computation for PEFT by xrdaukar in https://github.com/oumi-ai/oumi/pull/372
* Add `empty_device_cache_steps` param and configure it for Llama8b model by xrdaukar in https://github.com/oumi-ai/oumi/pull/373
* Add `TelemetryCallback.include_timer_metrics` param: `False` by default by xrdaukar in https://github.com/oumi-ai/oumi/pull/378
* Update llama8b GCP launcher script to allow Spot VMs by xrdaukar in https://github.com/oumi-ai/oumi/pull/380
* Minimal Llama8B LoRA eval config by xrdaukar in https://github.com/oumi-ai/oumi/pull/376
* Add Llama 8b SFT config by wizeng23 in https://github.com/oumi-ai/oumi/pull/379
* Move common NCCL variables initialization into `polaris_init.sh` by xrdaukar in https://github.com/oumi-ai/oumi/pull/377
* Minor tuning of llama8b configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/382
* Update eval script to use `Meta-Llama-3.1-8B-Instruct` model version by xrdaukar in https://github.com/oumi-ai/oumi/pull/381
* Initial notebook for llama 8b LoRa tuning. by taenin in https://github.com/oumi-ai/oumi/pull/374
* Update SkyPilot GCP script to download the right model version by xrdaukar in https://github.com/oumi-ai/oumi/pull/385
* Clean up Sky configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/383
* Update main makefile to generate docs by oelachqar in https://github.com/oumi-ai/oumi/pull/386
* Add docs-serve makefile command by oelachqar in https://github.com/oumi-ai/oumi/pull/387
* Fix missing new line at the end of `Makefile` by xrdaukar in https://github.com/oumi-ai/oumi/pull/390
* Raise `NOT_IMPLEMENTED` if `adapter_model` is configured for `LM_HARNESS` eval by xrdaukar in https://github.com/oumi-ai/oumi/pull/391
* Update Llama8B LoRA eval script to use built-in LEMA evaluator by xrdaukar in https://github.com/oumi-ai/oumi/pull/389
* Add Llama 70b lora config by wizeng23 in https://github.com/oumi-ai/oumi/pull/388
* Enable markdown docs by oelachqar in https://github.com/oumi-ai/oumi/pull/394
* Check ignored docstring rules by oelachqar in https://github.com/oumi-ai/oumi/pull/395
* Remove special case for saving PEFT models by xrdaukar in https://github.com/oumi-ai/oumi/pull/384
* Move shared code into polaris_init by wizeng23 in https://github.com/oumi-ai/oumi/pull/392
* Update Llama notebook to include 8B SFT by wizeng23 in https://github.com/oumi-ai/oumi/pull/393
* Update sample commands to point to the preemptable queue by taenin in https://github.com/oumi-ai/oumi/pull/396
* Update lm_harness to support LoRA adapters by jgreer013 in https://github.com/oumi-ai/oumi/pull/397
* Fix FSDP model initialization by wizeng23 in https://github.com/oumi-ai/oumi/pull/398
* Add vscode launch config for accelerate distributed training by oelachqar in https://github.com/oumi-ai/oumi/pull/400
* Update trainer save model by oelachqar in https://github.com/oumi-ai/oumi/pull/399
* Increase from 2 to 3 nodes for Llama 70B Lora by wizeng23 in https://github.com/oumi-ai/oumi/pull/402
* Add param to customize NCCL timeout by oelachqar in https://github.com/oumi-ai/oumi/pull/401
* Add docs and gpu install targets by oelachqar in https://github.com/oumi-ai/oumi/pull/403
* Significant improvements for the Polaris launcher by taenin in https://github.com/oumi-ai/oumi/pull/404
* Ensure that jobs are queued on existing clusters when users call UP by taenin in https://github.com/oumi-ai/oumi/pull/406
* Autostop sky clusters after 30 min of no activity by taenin in https://github.com/oumi-ai/oumi/pull/407
* Add support for triton kernels from Liger Kernel by oelachqar in https://github.com/oumi-ai/oumi/pull/405
* Add support for including notebooks in the docs by oelachqar in https://github.com/oumi-ai/oumi/pull/408
* Update sphinx comments to docstrings by oelachqar in https://github.com/oumi-ai/oumi/pull/411
* Add missing docstrings to TrainingParams by oelachqar in https://github.com/oumi-ai/oumi/pull/409
* Capped model max length for Llama tuning by wizeng23 in https://github.com/oumi-ai/oumi/pull/413
* Fix a deadlock in the Polaris launcher for users with 500+ jobs. by taenin in https://github.com/oumi-ai/oumi/pull/412
* Script to run inference with Llama/GPT judges. by kaisopos in https://github.com/oumi-ai/oumi/pull/414
* Add missing docstrings to top-level configs by oelachqar in https://github.com/oumi-ai/oumi/pull/410
* [tiny] sphinx conf update by oelachqar in https://github.com/oumi-ai/oumi/pull/416
* Improve launcher polling by running tasks in a subprocess. by taenin in https://github.com/oumi-ai/oumi/pull/417
* Add missing package docstrings by oelachqar in https://github.com/oumi-ai/oumi/pull/415
* [tiny] Enable D104 rule by oelachqar in https://github.com/oumi-ai/oumi/pull/419
* Fix bug with 70B Lora by wizeng23 in https://github.com/oumi-ai/oumi/pull/421
* Update the CLI to look for open SSH tunnels as a way of preserving Polaris state by taenin in https://github.com/oumi-ai/oumi/pull/418
* Update the polaris launcher to always update the lema installation on job creation. by taenin in https://github.com/oumi-ai/oumi/pull/422
* Cleanup doc RSTs by oelachqar in https://github.com/oumi-ai/oumi/pull/420
* Add sphinx api doc template for packages by oelachqar in https://github.com/oumi-ai/oumi/pull/425
* Add automatically generated apidoc RSTs by oelachqar in https://github.com/oumi-ai/oumi/pull/424
* [tiny] Move apidocs into their own folder by oelachqar in https://github.com/oumi-ai/oumi/pull/426
* Add docs-rebuild command to Makefile by oelachqar in https://github.com/oumi-ai/oumi/pull/427
* Refresh markdown docs by oelachqar in https://github.com/oumi-ai/oumi/pull/429
* Reorganize our test structure by taenin in https://github.com/oumi-ai/oumi/pull/431
* Add Llama 70B SFT config by wizeng23 in https://github.com/oumi-ai/oumi/pull/428
* Script to generate judge prompts. by kaisopos in https://github.com/oumi-ai/oumi/pull/423
* [tiny] Breakdown main Readme into multiple docs by oelachqar in https://github.com/oumi-ai/oumi/pull/430
* Update main readme file by oelachqar in https://github.com/oumi-ai/oumi/pull/432
* Add GitHub badges, readme typos by oelachqar in https://github.com/oumi-ai/oumi/pull/434
* Fix markdown lint errors by oelachqar in https://github.com/oumi-ai/oumi/pull/433
* Update documentation index by oelachqar in https://github.com/oumi-ai/oumi/pull/436
* [tiny] Only log to console on global leader by wizeng23 in https://github.com/oumi-ai/oumi/pull/435
* Tune sphinx config by oelachqar in https://github.com/oumi-ai/oumi/pull/437
* Enable Liger for Llama 8B SFT by wizeng23 in https://github.com/oumi-ai/oumi/pull/439
* Updated Parallel Inference job by jgreer013 in https://github.com/oumi-ai/oumi/pull/438
* Add a mkdir to polaris init. by taenin in https://github.com/oumi-ai/oumi/pull/440
* [tiny] Fix lema loop performance gap by oelachqar in https://github.com/oumi-ai/oumi/pull/441
* [tiny] update trainer benchmark script and minor updates by oelachqar in https://github.com/oumi-ai/oumi/pull/443
* Add Llama 8B eval script by wizeng23 in https://github.com/oumi-ai/oumi/pull/442
* Add dataset remote code param by oelachqar in https://github.com/oumi-ai/oumi/pull/445
* [docs] Update format + add missing docs to data_params.py by oelachqar in https://github.com/oumi-ai/oumi/pull/444
* Update Polaris Llama8b eval script to enable data-parallel evals for LM_HARNESS by xrdaukar in https://github.com/oumi-ai/oumi/pull/446
* Copy changes from PR-446 into Polaris launcher config by xrdaukar in https://github.com/oumi-ai/oumi/pull/448
* Copy changes from PR-446 into GCP launcher config by xrdaukar in https://github.com/oumi-ai/oumi/pull/449
* Minor fixes in llama8B eval scripts by xrdaukar in https://github.com/oumi-ai/oumi/pull/450
* Add Llama 70B eval script by wizeng23 in https://github.com/oumi-ai/oumi/pull/447
* [bugfix] add is_using_accelerate_fsdp util by oelachqar in https://github.com/oumi-ai/oumi/pull/453
* [tiny] Fix inference notebook by wizeng23 in https://github.com/oumi-ai/oumi/pull/451
* Simplify record_function annotation in LEMA training loop by xrdaukar in https://github.com/oumi-ai/oumi/pull/454
* [tiny] enable ruff format on save with notebooks by oelachqar in https://github.com/oumi-ai/oumi/pull/455
* [tiny] Add missing default value to hf_trainer by oelachqar in https://github.com/oumi-ai/oumi/pull/458
* Judge inference script for Polaris by kaisopos in https://github.com/oumi-ai/oumi/pull/452
* Add the base classes for inference. Pull out logic from `infer` to a native text inference engine. by taenin in https://github.com/oumi-ai/oumi/pull/456
* Telemetry improvements for tracking GPU temperature and in general by xrdaukar in https://github.com/oumi-ai/oumi/pull/457
* Add integration tests for native inference (not using the CLI). by taenin in https://github.com/oumi-ai/oumi/pull/460
* Update README.md by mkoukoumidis in https://github.com/oumi-ai/oumi/pull/462
* Update README to make installation steps more prominent by taenin in https://github.com/oumi-ai/oumi/pull/464
* Fix several broken links and update installation instructions by taenin in https://github.com/oumi-ai/oumi/pull/465
* Update inference to pass the generation config to inference engines. by taenin in https://github.com/oumi-ai/oumi/pull/466
* Update README.md by taenin in https://github.com/oumi-ai/oumi/pull/467
* Fixed issue with metadata extraction failure by jgreer013 in https://github.com/oumi-ai/oumi/pull/469
* Add fsdp support to lema loop by oelachqar in https://github.com/oumi-ai/oumi/pull/463
* Combine telemetry from all ranks by xrdaukar in https://github.com/oumi-ai/oumi/pull/468
* Add sample for full fine-tuned and LoRA-tuned model inference using vLLM by wizeng23 in https://github.com/oumi-ai/oumi/pull/470
* Update chat_template_builder by oelachqar in https://github.com/oumi-ai/oumi/pull/472
* Removed duplicate task_done call by jgreer013 in https://github.com/oumi-ai/oumi/pull/473
* Add flag to enable experimental torch data pipes processing pipeline by oelachqar in https://github.com/oumi-ai/oumi/pull/474
* Vision-languange datasets & fine-tuning MVP by oelachqar in https://github.com/oumi-ai/oumi/pull/459
* Rebuild docs, add multi-modal tutorial by oelachqar in https://github.com/oumi-ai/oumi/pull/475
* Add test coverage target, update pyproject.toml metadata by oelachqar in https://github.com/oumi-ai/oumi/pull/476
* Create a local inference engine for vLLM by taenin in https://github.com/oumi-ai/oumi/pull/471
* Add llava chat template, QoL improvement to multimodal testing script by oelachqar in https://github.com/oumi-ai/oumi/pull/478
* [Polaris Judge Inference] Adjusting script for Llama 70B quantized by kaisopos in https://github.com/oumi-ai/oumi/pull/461
* Add example for running inference using vLLM on GCP, single-node multi-gpu by oelachqar in https://github.com/oumi-ai/oumi/pull/479
* [tiny] Remove deepspeed from required dependencies by oelachqar in https://github.com/oumi-ai/oumi/pull/482
* Update train path to save meta-info as files under `telemetry` sub-dir by xrdaukar in https://github.com/oumi-ai/oumi/pull/480
* Add inference engine apply_chat_template helper, update example notebook by oelachqar in https://github.com/oumi-ai/oumi/pull/481
* Update arg names for vLLM inference job by wizeng23 in https://github.com/oumi-ai/oumi/pull/477
* Remove device_map for model init from config by wizeng23 in https://github.com/oumi-ai/oumi/pull/484
* Add `log_model_summary` call back by xrdaukar in https://github.com/oumi-ai/oumi/pull/485
* Small typo fix in the vllm notebook by taenin in https://github.com/oumi-ai/oumi/pull/483
* Cleanup FSDP wrap class auto guesser by oelachqar in https://github.com/oumi-ai/oumi/pull/486
* Add missing documentation for model_params by oelachqar in https://github.com/oumi-ai/oumi/pull/487
* Add callback builder function by oelachqar in https://github.com/oumi-ai/oumi/pull/490
* Minor fixes in DISTRIBUTED_TRAINING.md by xrdaukar in https://github.com/oumi-ai/oumi/pull/488
* Switch to using official UV action with dependency caching by oelachqar in https://github.com/oumi-ai/oumi/pull/491
* Introduce `BaseTrainerCallback` alias by xrdaukar in https://github.com/oumi-ai/oumi/pull/492
* Add documentation to peft_params by oelachqar in https://github.com/oumi-ai/oumi/pull/493
* Update `TelemetryCallback` to save final metrics to JSON by xrdaukar in https://github.com/oumi-ai/oumi/pull/494
* Increase the rsync timeout from 40s to 300s by taenin in https://github.com/oumi-ai/oumi/pull/495
* [tiny] fix missing import by oelachqar in https://github.com/oumi-ai/oumi/pull/497
* Rename build_dataset -> build_dataset_mixture by oelachqar in https://github.com/oumi-ai/oumi/pull/498
* Define a simple callback to detect NaN/INF-s during training by xrdaukar in https://github.com/oumi-ai/oumi/pull/496
* Replace `pip install flash-attn` with `.[gpu]` target by wizeng23 in https://github.com/oumi-ai/oumi/pull/502
* Add simpler builder for single dataset use cases by oelachqar in https://github.com/oumi-ai/oumi/pull/499
* Use HF's built-in gradient checkpointing argument by wizeng23 in https://github.com/oumi-ai/oumi/pull/500
* [Draft] Example changes to support 70B single-node inference by jgreer013 in https://github.com/oumi-ai/oumi/pull/503
* Various updates to Llama 2b configs by wizeng23 in https://github.com/oumi-ai/oumi/pull/489
* Add Llama 2B FSDP config by wizeng23 in https://github.com/oumi-ai/oumi/pull/505
* Update `TelemetryCallback` to write JSON with GPU temperature summary by xrdaukar in https://github.com/oumi-ai/oumi/pull/501
* Rename src/lema to src/oumi by wizeng23 in https://github.com/oumi-ai/oumi/pull/506
* OpenAI Chat Engine - Custom servers by taenin in https://github.com/oumi-ai/oumi/pull/504
* Rename configs/lema to configs/oumi by wizeng23 in https://github.com/oumi-ai/oumi/pull/507
* Rename all relevant lema references in codebase by wizeng23 in https://github.com/oumi-ai/oumi/pull/508
* Re-generate Sphinx docs by wizeng23 in https://github.com/oumi-ai/oumi/pull/509
* Update conf.py by taenin in https://github.com/oumi-ai/oumi/pull/510
* Rename remaining lema references in `docs/ ` by wizeng23 in https://github.com/oumi-ai/oumi/pull/511
* Update final lema references by wizeng23 in https://github.com/oumi-ai/oumi/pull/512
* Update dev setup guide by wizeng23 in https://github.com/oumi-ai/oumi/pull/513
* Update TOTAL_NUM_GPUS compare commands in SkyPilot configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/514
* [Minor] Issues arose by "newcomer" exploration [1/K] by optas in https://github.com/oumi-ai/oumi/pull/518
* Freeze `lm-eval` and `torch` versions as a workaround for OPE-390 by xrdaukar in https://github.com/oumi-ai/oumi/pull/516
* 1. Write wandb telemetery 2. Reorder training callbacks by xrdaukar in https://github.com/oumi-ai/oumi/pull/519
* Multiple updates to Llama 2B by wizeng23 in https://github.com/oumi-ai/oumi/pull/515
* Rename OUMI to Oumi by wizeng23 in https://github.com/oumi-ai/oumi/pull/520
* Add llama.cpp Inference Engine by oelachqar in https://github.com/oumi-ai/oumi/pull/524
* Rename website references to oumi.ai by wizeng23 in https://github.com/oumi-ai/oumi/pull/522
* Add anthropic inference engine by oelachqar in https://github.com/oumi-ai/oumi/pull/523
* Update name typo by oelachqar in https://github.com/oumi-ai/oumi/pull/526
* Add a batch inference job runnable via the Oumi Launcher by taenin in https://github.com/oumi-ai/oumi/pull/527
* Auto-format `pyproject` and `pre-commit` configs by xrdaukar in https://github.com/oumi-ai/oumi/pull/530
* Update Makefile by taenin in https://github.com/oumi-ai/oumi/pull/529
* Fix failing tests after a new install. by taenin in https://github.com/oumi-ai/oumi/pull/531
* Fix a small bug in `infer_interactive()`: only prints the first character by xrdaukar in https://github.com/oumi-ai/oumi/pull/532
* Boosting User-friendliness by optas in https://github.com/oumi-ai/oumi/pull/521
* [tiny] add override from typing_extentions by oelachqar in https://github.com/oumi-ai/oumi/pull/534
* Create CODE_OF_CONDUCT.md by taenin in https://github.com/oumi-ai/oumi/pull/536
* Add conversation helper methods by oelachqar in https://github.com/oumi-ai/oumi/pull/535
* [tiny] cleanup multimodal benchmark script by oelachqar in https://github.com/oumi-ai/oumi/pull/537
* Auto-format shell scripts under `scripts` by xrdaukar in https://github.com/oumi-ai/oumi/pull/539
* Add builder function for data collators by oelachqar in https://github.com/oumi-ai/oumi/pull/538
* Make tokenizer optional by oelachqar in https://github.com/oumi-ai/oumi/pull/540
* Add an optional `-t` flag to scripts/polaris/jobs/llama2b_pt_worker.sh by xrdaukar in https://github.com/oumi-ai/oumi/pull/541
* Fix initial issues found by `shellcheck` by xrdaukar in https://github.com/oumi-ai/oumi/pull/542
* [tiny] fix small typo by oelachqar in https://github.com/oumi-ai/oumi/pull/544
* Minor changes in `scripts/benchmarks/minimal_multimodal_training.py` by xrdaukar in https://github.com/oumi-ai/oumi/pull/543
* [tiny] Add util to get install folder root dir by oelachqar in https://github.com/oumi-ai/oumi/pull/545
* [tiny] Add fp paged_adam optimizer option by oelachqar in https://github.com/oumi-ai/oumi/pull/547
* [tiny] Allow conversation metadata to contain values other than str by oelachqar in https://github.com/oumi-ai/oumi/pull/546
* Switch from Flash Attention 2 to PyTorch SDPA by wizeng23 in https://github.com/oumi-ai/oumi/pull/533
* Use `local_rank` to query GPU temperature by xrdaukar in https://github.com/oumi-ai/oumi/pull/550
* Fix a bug for handling stopped sky clusters in the oumi launcher. by taenin in https://github.com/oumi-ai/oumi/pull/549
* Remove flash attention validation check by wizeng23 in https://github.com/oumi-ai/oumi/pull/551
* Add support for AWS and Azure jobs in Oumi by taenin in https://github.com/oumi-ai/oumi/pull/552
* Pass `split` param to `datasets.load_dataset()` by xrdaukar in https://github.com/oumi-ai/oumi/pull/553
* Implement Judge API MVP by oelachqar in https://github.com/oumi-ai/oumi/pull/548
* Log dataset info: shape, columns, other metainfo by xrdaukar in https://github.com/oumi-ai/oumi/pull/555
* Update experimental pretokenize_dataset tool by xrdaukar in https://github.com/oumi-ai/oumi/pull/554
* Various improvements to Llama eval scripts by wizeng23 in https://github.com/oumi-ai/oumi/pull/556
* Add a couple of `gc.collect()` calls by xrdaukar in https://github.com/oumi-ai/oumi/pull/560
* [tiny] Fix Makefile setup command by wizeng23 in https://github.com/oumi-ai/oumi/pull/561
* Support datasets generated by `dataset.save_to_disk()` by xrdaukar in https://github.com/oumi-ai/oumi/pull/559
* Add support for LoRA adapters in vLLM inference engine by wizeng23 in https://github.com/oumi-ai/oumi/pull/562
* Updates in `VisionLanguageCollator` and in `coco_captions` by xrdaukar in https://github.com/oumi-ai/oumi/pull/563
* Update DEV_SETUP.md with Windows instructions by taenin in https://github.com/oumi-ai/oumi/pull/566
* Make the remote inference engine runnable in jupyter notebooks. by taenin in https://github.com/oumi-ai/oumi/pull/565
* Configure freeze_layer map in `minimal_multimodal_training.py` by xrdaukar in https://github.com/oumi-ai/oumi/pull/569
* Clean up legacy evaluate_oumi code paths by taenin in https://github.com/oumi-ai/oumi/pull/568
* Update model builder to use `default_chat_template` if available by xrdaukar in https://github.com/oumi-ai/oumi/pull/571
* Add package build and deployment workflow to google artifact registry by oelachqar in https://github.com/oumi-ai/oumi/pull/570

New Contributors
* oelachqar made their first contribution in https://github.com/oumi-ai/oumi/pull/1
* kaisopos made their first contribution in https://github.com/oumi-ai/oumi/pull/7
* jgreer013 made their first contribution in https://github.com/oumi-ai/oumi/pull/9
* optas made their first contribution in https://github.com/oumi-ai/oumi/pull/10
* xrdaukar made their first contribution in https://github.com/oumi-ai/oumi/pull/12
* wizeng23 made their first contribution in https://github.com/oumi-ai/oumi/pull/30
* taenin made their first contribution in https://github.com/oumi-ai/oumi/pull/95
* mkoukoumidis made their first contribution in https://github.com/oumi-ai/oumi/pull/462

**Full Changelog**: https://github.com/oumi-ai/oumi/commits/v0.1-alpha

Page 2 of 2

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.