Commit History
Fix falcon tokenization step (#1441) [skip ci]
		bcdc9b1
	
		
		unverified
	turn sample_packing on for training (#1438) [skip ci]
		c19d060
	
		
		unverified
	make sure to capture non-null defaults from config validation (#1415)
		601b77b
	
		
		unverified
	fix(dataset): normalize tokenizer config and change hash from tokenizer class to tokenizer path (#1298)
		ff939d8
	
		
		unverified
	docs: update link to docs of advance topic in README.md (#1437)
		324d59e
	
		
		unverified
	chore(config): refactor old mistral config (#1435)
		f1ebaa0
	
		
		unverified
	Fix ORPO multi gpu (#1433)
		34ba634
	
		
		unverified
	Update docs.yml
		4e69aa4
	
		
		unverified
	Bootstrap Hosted Axolotl Docs w/Quarto (#1429)
		629450c
	
		
		unverified
	strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428)
		2a1589f
	
		
		unverified
	HF / FEAT: Optimize HF tags (#1425) [skip ci]
		7d55607
	
		
		unverified
	fixes for dpo and orpo template loading (#1424)
		7803f09
	
		
		unverified
	support galore once upstreamed into transformers (#1409)
		dd449c5
	
		
		unverified
	Feat: Add sharegpt multirole (#1137)
		40a88e8
	
		
		unverified
	fix(config): passing gradient_checkpoint_kwargs (#1412)
		b1e3e1b
	
		
		unverified
	ORPO (#1419)
		2ea70eb
	
		
		unverified
	Update README.md (#1418)
		e8c8ea6
	
		
		unverified
	
		jbl
		
	commited on
		
		
chore(script): remove redundant setting (#1411)
		d485a08
	
		
		unverified
	Fix(readme): Improve README QuickStart info (#1408)
		f083aed
	
		
		unverified
	Feat(readme): Add instructions for Google GPU VM instances (#1410)
		868c339
	
		
		unverified
	beta support for multipack with gemmoe: (#1402)
		8df7b88
	
		
		unverified
	Fix Gemma 7b qlora.yml (#1405)
		6366b0c
	
		
		unverified
	Train parameters exclusively in specific ranges (#1390)
		05bcc9e
	
		
		unverified
	Don't disable existing loggers when configuring axolotl logging (#1395)
		3bd8203
	
		
		unverified
	Add QLoRA + FSDP Docs (#1403)
		8b12468
	
		
		unverified
	Update ChatTemplate enum to include alpaca and gemma (#1396)
		0976781
	
		
		unverified
	add handling for argilla dpo-mix (#1397)
		8a82d2e
	
		
		unverified
	chore: lint (#1389)
		4326520
	
		
		unverified
	Add Glaive conversation format support (#1365)
		b7d8a7d
	
		
		unverified
	Set `gradient_clipping` to `auto` in DeepSpeed configs (#1382) [skip ci]
		b0ee9ec
	
		
		unverified
	support for rslora (#1387) [skip ci]
		7659c00
	
		
		unverified
	validation for fsdp and deepspeed (#1388) [skip ci]
		3fd8093
	
		
		unverified
	FDSP + QLoRA (#1378)
		9b6ee83
	
		
		unverified
	JarvisLabs (#1372)
		638c2da
	
		
		unverified
	update flash attention for gemma support: (#1368)
		58b0d4b
	
		
		unverified
	add docs for `input_output` format (#1367) [skip ci]
		ed70a08
	
		
		unverified
	support for DoRA w/ PEFT (#1363)
		0cfdb2c
	
		
		unverified
	Remove unsupported python version 3.9 from README (#1364) [skip ci]
		3765747
	
		
		unverified
	
		Nicolas Rojas
		
	commited on