File size: 2,832 Bytes
6850319 8a6e9c5 6850319 83dc8b2 071ae7d e5a0935 83dc8b2 eb94555 83dc8b2 eb94555 83dc8b2 0747168 83dc8b2 071ae7d 83dc8b2 7a9282f 83dc8b2 7238c62 95cd2f8 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 | ---
license: apache-2.0
datasets:
- openbmb/Ultra-FineWeb
- Alic-Li/Translate_datasets
- Alic-Li/jp_zh_translate_datasets
language:
- en
- zh
base_model:
- BlinkDL/rwkv7-g1
pipeline_tag: translation
tags:
- RWKV_V7
- Englisg->Chinese
- 0.4B
- 1.5B
---
## π§ Model Overview
- This project provides an **English-to-Chinese translation model** based on the **RWKV-V7 architecture**, with approximately **0.4 billion parameters** and **1.5 billion parameters**.
- Model weight fine-tuning base on [https://huggingface.co/BlinkDL/rwkv7-g1](https://huggingface.co/BlinkDL/rwkv7-g1)
- The model has been fully fine-tuned on translation tasks and demonstrates strong performance across various domains, especially in handling long sentences, technical terminology, and culturally nuanced expressions.
- Unlike traditional Transformer-based models, RWKV combines the sequential state-passing mechanism of RNNs with the parallel training capabilities of Transformers. This unique design enables efficient inference while maintaining powerful sequence modeling abilities, making it ideal for deployment on resource-constrained environments such as mobile devices, embedded systems, or edge computing platforms.
### π¦ Install Dependencies
#### π’ For Nvidia CUDA
```bash
pip install torch rwkv gradio
```
#### π΄ For AMD ROCm
- set ```os.environ["RWKV_CUDA_ON"] = '0' ```
```bash
pip install torch --index-url https://download.pytorch.org/whl/rocm6.3
pip install rwkv gradio
```
### π Run The demo
- Change line 20 in ```webui_new.py``` to you own model weights path
```bash
python webui_new.py
```
## β οΈ Notice
~~- This model currently supports **English β Chinese** translation only.~~
- Now it support **English β Chinese** & **Chinese β English** ~~~
## π‘ Key Advantages
- β
**Lightweight and Deployment-Friendly**: Achieves high-quality translation with only 0.4B / 1.5B parameters.
- β
**Strong Long-Context Modeling**: Supports input lengths up to 4096 tokens.
- β
**Low Memory Footprint**: Ideal for edge devices, mobile apps, and embedded systems.
- β
**Multilingual Potential**: Built upon a multilingual pre-training foundation, future versions may support more language pairs.
## π Recommended Resources
- π [Official RWKV Repo](https://github.com/BlinkDL/RWKV-LM)
- π§ͺ [Official RWKV Website](https://www.rwkv.cn/)
- π§° [Official RWKV project collection](https://github.com/RWKV-Vibe)
- π¦ [Official fine-tuning Repo](https://github.com/JL-er/RWKV-PEFT)
- π€ [RWKV Runner](https://github.com/josStorer/RWKV-Runner)
- π [AI00 Web Server](https://github.com/Ai00-X/ai00_server)
## π§© Developer Info
- **Developer**: Alic Li
- **GitHub**: [https://github.com/Alic-Li](https://github.com/Alic-Li)
- **Contact**: [alic2591709191@gmail.com](alic2591709191@gmail.com) |