Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Hanshen Zhu's picture
Open to Collab
10 7 5

Hanshen Zhu

CIawevy
·
  • CIawevy

AI & ML interests

Controllable Generation, Text Rendering, Large Language Model, Agent Modeling

Organizations

VLRLab-OCR's profile picture

upvoted a paper 3 months ago

OmniPSD: Layered PSD Generation with Diffusion Transformer

Paper • 2512.09247 • Published Dec 10, 2025 • 51
upvoted a collection 3 months ago

FreeFine

Collection
Training-Free Diffusion for Geometric Image Editing • 3 items • Updated Feb 17 • 1
upvoted 2 papers 3 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 137

TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering

Paper • 2602.20903 • Published Feb 24 • 1
upvoted a collection 4 months ago

TextPecker

Collection
Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering • 8 items • Updated Feb 25 • 1
upvoted 2 papers 4 months ago

SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting

Paper • 2504.09966 • Published Apr 14, 2025 • 1

Training-free Geometric Image Editing on Diffusion Models

Paper • 2507.23300 • Published Jul 31, 2025 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs