LLM Adaptation to Czech Language
Collection
This collection accompanies the master's thesis on Compute-constrained LLM adaptation to Czech language. Available from: TBA.
β’
12 items
β’
Updated
Llama 3.1 8B continuously pretrained on a mixture of FineWeb2 and FineWeb-Edu datasets and instsruction-tuned using a mixture of English and Czech Alpaca and Dolly datasets. More information in the thesis: TBA. (The notation is thesis is: B->CP_(cs+en)+IT_(cs+en (DA)))
This model is a Czech-adapted version of Meta's LLaMA 3.1 8B, developed as part of master's thesis. It is intended solely for academic and research purposes.
Researchers and practitioners using this model must ensure appropriate ethical oversight and conduct rigorous evaluations before any further deployment or fine-tuning.
TBA