Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
2
1
12
Arvind Rajasekaran
arvindcr4
Follow
Balasandhya's profile picture
1 follower
·
14 following
AI & ML interests
None yet
Recent Activity
liked
a Space
about 1 month ago
AdithyaSK/rl-environments-guide
updated
a Space
about 2 months ago
arvindcr4/tinkerrl-bench-demo
published
a Space
about 2 months ago
arvindcr4/tinkerrl-bench-demo
View all activity
Organizations
None yet
arvindcr4
's models
42
Sort: Recently updated
arvindcr4/tinker-rl-bench-ppo_gsm8k_Llama-3.1-8B-Instruct_s42
Text Generation
•
Updated
Apr 19
•
2
arvindcr4/tinker-rl-bench-kl_track_Qwen3-8B_s42
Reinforcement Learning
•
Updated
Apr 19
arvindcr4/tinker-rl-bench-scale_gsm8k_qwen3-8b
Reinforcement Learning
•
Updated
Apr 18
arvindcr4/tinker-rl-bench-frontier_gsm8k_deepseek-v3.1
Reinforcement Learning
•
Updated
Apr 18
arvindcr4/tinker-rl-bench-cross_tool_llama-8b-inst
Reinforcement Learning
•
Updated
Apr 18
arvindcr4/skyrl-tinker-qwen3-8b-tool_use
Updated
Mar 29
•
2
arvindcr4/llama-3.2-1b-arithmetic-rl-lora
Reinforcement Learning
•
Updated
Mar 14
•
2
arvindcr4/llama-3.2-1b-distillation-offpolicy-lora
Updated
Mar 14
•
2
arvindcr4/tool-call-lora-qwen2.5-7b
Updated
Mar 9
•
1
arvindcr4/tool-call-lora-qwen2.5-3b
Updated
Mar 9
•
3
arvindcr4/tool-call-lora-qwen1.5b
Updated
Mar 9
•
1
arvindcr4/tool-call-lora-qwen0.5b
Updated
Mar 9
•
1
Previous
1
2
Next