Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
InfoBayAI 's Collections
Thesis Dataset
Longitudinal Time Series Dataset
Codebase Datasets
Healthcare AI Datasets for Clinical & LLM Training
Dual Channel Global Customer-Agent Interaction Datasets
SIngle-Channel-Call-Center-Audio-Dataset
Dual-channel Podcast Speech Audio Datasets
Single-channel Podcast Speech Audio Datasets
UGC and STEM Video Datasets
Academic Textbook Corpora for LLM Training
STEM & Non-STEM Q&A Datasets for LLM Training
Computer Vision & Multimodal Datasets
Egocentric videos

STEM & Non-STEM Q&A Datasets for LLM Training

updated 11 days ago

Sample datasets from a 6.5M+ enterprise-grade Q&A corpus across STEM and Non-STEM domains, built for LLM training, instruction tuning, and evaluation.

Upvote
1

  • InfoBayAI/Hindi-STEM-QA-MCQ-Dataset

    Viewer • Updated 14 days ago • 200 • 67

  • InfoBayAI/English-STEM-QA-MCQ-Dataset

    Viewer • Updated 14 days ago • 320 • 60

  • InfoBayAI/English-Non-STEM-QA-MCQ-Dataset

    Viewer • Updated 14 days ago • 50 • 57

  • InfoBayAI/Arabic-STEM-QA-MCQ-Dataset

    Viewer • Updated 14 days ago • 49 • 61

  • InfoBayAI/Arabic-Non-STEM-QA-MCQ-Dataset

    Viewer • Updated 14 days ago • 44 • 53

  • InfoBayAI/Hindi-Non-STEM-QA-MCQ-Dataset

    Viewer • Updated 14 days ago • 50 • 43
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs