wrong ouput

by KlausRust - opened Oct 16, 2025

Oct 16, 2025

Hi, thanks for sharing this quantized model, but my output maybe incorrect.
I use the example below:
conversation = [
{
"role": "user",
"content": [
{"type": "image", "image": "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen3-Omni/demo/cars.jpg"},
{"type": "audio", "audio": "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen3-Omni/demo/cough.wav"},
{"type": "text", "text": "What can you see and hear? Answer in one short sentence."}
],
},
]

I'm running this on my jetson orin AGX, and here is the output:
"The image is a collage of repeating text that says "I'm a little teapot," and the audio is a series of short, percussive, clicking sounds."

It seems to be a wrong answer

lq0104

Nov 3, 2025

Are you using Docker, If so, could you share the image you’re using? I was wondering if this might be related to the environment.

KlausRust

Nov 5, 2025

Are you using Docker, If so, could you share the image you’re using? I was wondering if this might be related to the environment.

No, I didn't use docker

lq0104

Nov 27, 2025

Can you tell me how to compile vllm on Jetson Orin AGX? I haven't been successful no matter how I try. Thank you.

KlausRust

Nov 28, 2025

Can you tell me how to compile vllm on Jetson Orin AGX? I haven't been successful no matter how I try. Thank you.

I did it in a standard way, perhaps you can post your issues here.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment