Safetensors
llama
dyogatama commited on
Commit
75c9114
·
verified ·
1 Parent(s): 176ffe9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -8
README.md CHANGED
@@ -3,18 +3,15 @@ license: apache-2.0
3
  ---
4
 
5
  # Reka Flash 3.1
6
- Reka Flash 3.1 is a 21B general-purpose reasoning model that was trained from scratch. It was trained in synthetic and public datasets for supervised finetuning, followed by large-scale RLOO with rule-based rewards. Reka Flash 3.1 is an improved version of Reka Flash 3 due to significant advances in our reinforcement learning stack and curated high-qaulity RL data. Reka Flash 3.1 is particularly strong on coding and as a base model to be finetuned on agentic tasks.
7
- Reka Flash 3.1 improves by 10 points on LiveCodeBench v5 (Full set) from Reka Flash 3. For coding related tasks, Reka Flash 3.1 is competitive with models such as Qwen3-32B. o3-mini, and Gemini 2.5 Flash Thinking. If you want to learn more about how we do reinforcement learning for Reka Flash 3.1 that results in these improvements, please check out this post.
 
8
 
9
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65a776598ee30c06716aa380/e5SsUG4vTgslFtcJxM9DT.png)
10
 
11
- Try it out at [Reka Space](https://space.reka.ai).
12
-
13
- Strong reasoning and coding skills are important capabilities to support multimodal agentic use cases, and near-lossless quantization allows us to deploy our models anywhere. A multimodal version of Reka-Flash-3.1 serves as a base model for our core products Reka Research and Reka Vision. Please contact us for more information about how you can use them in your organizations.
14
-
15
- Model efficiency is critical for the local deployment. We also release a quantized version of Reka Flash 3.1 in this link. Meanwhile, we opensource the corresponding quantizatioon library at this link.
16
-
17
 
 
18
 
19
 
20
  ## Quickstart
 
3
  ---
4
 
5
  # Reka Flash 3.1
6
+ Reka Flash 3.1 is an update to our Reka Flash 3. It is particularly strong on coding and as a base model to be finetuned on agentic tasks. Reka Flash 3.1 was post trained with synthetic and public datasets for supervised finetuning, followed by large-scale reinforcement learning (RLOO) with verifiable rewards.
7
+ It improves by 10 points on LiveCodeBench v5 (Full set) from Reka Flash 3 due to significant advances in our reinforcement learning stack. For coding related tasks, Reka Flash 3.1 is competitive with models such as Qwen3-32B. o3-mini, and Gemini 2.5 Flash Thinking.
8
+ If you want to learn more about how we do reinforcement learning for Reka Flash 3.1 that results in these improvements, please check out this post.
9
 
10
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65a776598ee30c06716aa380/e5SsUG4vTgslFtcJxM9DT.png)
11
 
12
+ Try it out at our playground (https://app.reka.ai).
 
 
 
 
 
13
 
14
+ We also release a 3.5 bit quantized version of Reka Flash 3.1 and our quantizatioon library to better support local deployment.
15
 
16
 
17
  ## Quickstart