Safetensors
Turkish
aliarda's picture
Update README.md
7bca2e6 verified
|
Raw
History Blame Contribute Delete
754 Bytes
---
license: apache-2.0
datasets:
- alibayram/hepsiburada_yorumlar
- aliarda/turkish-news-1.8M-tokenized
language:
- tr
base_model:
- aliarda/llama-50M-randParams
- aliarda/llama-50M-latest
---
This is a Domain Adaptive PreTrained llama model. Made for experimental purposes.
You can use modeling files from [this GitHub repo](https://github.com/ardafincan/LM-playground).
- Model Size: 52,177,152
- Vocab Size: 32,768
- Context Length: 512
- Embedding Dimension: 256
- Attention Heads: 128
- KV Groups: 64
- Hidden Dimension: 2048
- Number of Layers: 20
Original pretrained model is trained on 1/4 of aliarda/turkish-news-1.8M-tokenized.
This model is then trained on 80% of alibayram/hepsiburada_yorumlar with the goal of adapting to another domain.