--- license: apache-2.0 datasets: - alibayram/hepsiburada_yorumlar - aliarda/turkish-news-1.8M-tokenized language: - tr base_model: - aliarda/llama-50M-randParams - aliarda/llama-50M-latest --- This is a Domain Adaptive PreTrained llama model. Made for experimental purposes. You can use modeling files from [this GitHub repo](https://github.com/ardafincan/LM-playground). - Model Size: 52,177,152 - Vocab Size: 32,768 - Context Length: 512 - Embedding Dimension: 256 - Attention Heads: 128 - KV Groups: 64 - Hidden Dimension: 2048 - Number of Layers: 20 Original pretrained model is trained on 1/4 of aliarda/turkish-news-1.8M-tokenized. This model is then trained on 80% of alibayram/hepsiburada_yorumlar with the goal of adapting to another domain.