Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
minhnguyent546 's Collections
e2026
cotu-legal-retriever
[model] Machine Translation Models
[dataset] image-text datasets
[dataset] embeddings-and-retrieval-learning
[dataset] text-generation
[model] embeddings
ViCLIP-OT
Med-Alpaca

[dataset] text-generation

updated Apr 10
Upvote
-

  • almanach/halvest

    Updated 21 days ago • 959 • 3

  • uonlp/CulturaX

    Viewer • Updated Dec 16, 2024 • 7.18B • 22.4k • 638

  • HuggingFaceFW/fineweb

    Viewer • Updated Jul 11, 2025 • 52.5B • 602k • 2.88k

  • HuggingFaceFW/fineweb-2

    Viewer • Updated Oct 27, 2025 • 4.48B • 62k • 817

  • HuggingFaceFW/finepdfs

    Viewer • Updated Apr 3 • 476M • 20.3k • 876

  • HuggingFaceFW/finewiki

    Viewer • Updated Oct 22, 2025 • 61.6M • 9.43k • 303

  • HuggingFaceFW/finetranslations

    Viewer • Updated Jan 9 • 3.33B • 10.8k • 294

  • Viet-Mistral/CulturaY

    Viewer • Updated Mar 30, 2024 • 1.14B • 20k • 40

  • ronantakizawa/github-top-code

    Viewer • Updated Feb 23 • 1.12M • 383 • 123

  • allenai/c4

    Viewer • Updated Jan 9, 2024 • 10.4B • 845k • 592

  • aisingapore/SEA-PILE-v1

    Viewer • Updated Dec 2, 2025 • 636M • 289 • 18
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs