Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

DavidAU
/
Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking

Image-Text-to-Text
Transformers
Safetensors
English
Chinese
unsloth
fine tune
all use cases
coder
creative
creative writing
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
science fiction
romance
all genres
story
writing
vivid prosing
vivid writing
fiction
roleplaying
bfloat16
Model card Files Files and versions
xet
Community
7

Instructions to use DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • Transformers

    How to use DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking with Transformers:

    # Use a pipeline as a high-level helper
    from transformers import pipeline
    
    pipe = pipeline("image-text-to-text", model="DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking")
    # Load model directly
    from transformers import AutoModel
    model = AutoModel.from_pretrained("DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking", dtype="auto")
  • Notebooks
  • Google Colab
  • Kaggle
  • Local Apps Settings
  • vLLM

    How to use DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking with vLLM:

    Install from pip and serve model
    # Install vLLM from pip:
    pip install vllm
    # Start the vLLM server:
    vllm serve "DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking"
    # Call the server using curl (OpenAI-compatible API):
    curl -X POST "http://localhost:8000/v1/completions" \
    	-H "Content-Type: application/json" \
    	--data '{
    		"model": "DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking",
    		"prompt": "Once upon a time,",
    		"max_tokens": 512,
    		"temperature": 0.5
    	}'
    Use Docker
    docker model run hf.co/DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking
  • SGLang

    How to use DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking with SGLang:

    Install from pip and serve model
    # Install SGLang from pip:
    pip install sglang
    # Start the SGLang server:
    python3 -m sglang.launch_server \
        --model-path "DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking" \
        --host 0.0.0.0 \
        --port 30000
    # Call the server using curl (OpenAI-compatible API):
    curl -X POST "http://localhost:30000/v1/completions" \
    	-H "Content-Type: application/json" \
    	--data '{
    		"model": "DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking",
    		"prompt": "Once upon a time,",
    		"max_tokens": 512,
    		"temperature": 0.5
    	}'
    Use Docker images
    docker run --gpus all \
        --shm-size 32g \
        -p 30000:30000 \
        -v ~/.cache/huggingface:/root/.cache/huggingface \
        --env "HF_TOKEN=<secret>" \
        --ipc=host \
        lmsysorg/sglang:latest \
        python3 -m sglang.launch_server \
            --model-path "DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking" \
            --host 0.0.0.0 \
            --port 30000
    # Call the server using curl (OpenAI-compatible API):
    curl -X POST "http://localhost:30000/v1/completions" \
    	-H "Content-Type: application/json" \
    	--data '{
    		"model": "DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking",
    		"prompt": "Once upon a time,",
    		"max_tokens": 512,
    		"temperature": 0.5
    	}'
  • Unsloth Studio

    How to use DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking with Unsloth Studio:

    Install Unsloth Studio (macOS, Linux, WSL)
    curl -fsSL https://unsloth.ai/install.sh | sh
    # Run unsloth studio
    unsloth studio -H 0.0.0.0 -p 8888
    # Then open http://localhost:8888 in your browser
    # Search for DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking to start chatting
    Install Unsloth Studio (Windows)
    irm https://unsloth.ai/install.ps1 | iex
    # Run unsloth studio
    unsloth studio -H 0.0.0.0 -p 8888
    # Then open http://localhost:8888 in your browser
    # Search for DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking to start chatting
    Using HuggingFace Spaces for Unsloth
    # No setup required
    # Open https://huggingface.co/spaces/unsloth/studio in your browser
    # Search for DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking to start chatting
    Load model with FastModel
    pip install unsloth
    from unsloth import FastModel
    model, tokenizer = FastModel.from_pretrained(
        model_name="DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking",
        max_seq_length=2048,
    )
  • Docker Model Runner

    How to use DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking with Docker Model Runner:

    docker model run hf.co/DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Gratitude and a question

2
#7 opened about 1 month ago by
Lintrarius

We uploaded a Opus 4.6 dataset you should check it out!

🔥👍 1
#6 opened 3 months ago by
EclipseMist

Please try out the template I provided; it can best mimic Claude's way of thinking.

3
#5 opened 3 months ago by
win10

Modified chat template in llama.cpp

2
#4 opened 3 months ago by
tis111

This is quite good, any chance you can fine tune the MOE 35B 3A one next ?

7
#2 opened 3 months ago by
random-user10

Congrats

2
#1 opened 3 months ago by
singulariti
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs