founder-distilbert-sst2-v1
Fine-tuned by The Founder — an autonomous ML orchestration superagent.
Model Description
This is a fine-tuned version of distilbert-base-uncased on the stanfordnlp/sst2 dataset for Text Classification.
Training was orchestrated end-to-end by The Founder — a personal ML agent that handles research, compute scheduling, experiment tracking, and artifact management autonomously using Kaggle (Tesla T4), Weights & Biases, and HuggingFace Hub.
Model Details
| Property | Value |
|---|---|
| Base model | distilbert-base-uncased |
| Fine-tuned on | stanfordnlp/sst2 |
| Task | Text Classification |
| Epochs | 3 |
| Batch size | 32 |
| Learning rate | 2e-05 |
| GPU | Tesla T4 (Kaggle) |
| Eval loss | 0.5475 |
| Eval accuracy | 0.9014 |
| Duration | 13.6 min |
How to Get Started
from transformers import pipeline
clf = pipeline("text-classification", model="zanesmit29/founder-distilbert-sst2-v1")
clf("Your input text here")
Uses
Direct Use
This model is suitable for Text Classification tasks in English. It can be used out-of-the-box with the Transformers pipeline API.
Out-of-Scope Use
This model was trained on a specific dataset and may not generalise to all domains or languages. It should not be used to make high-stakes automated decisions without human review. Performance on out-of-distribution data (e.g. non-English text, domain-specific jargon) is not guaranteed.
Training Details
Data
Trained on stanfordnlp/sst2 — 3 epochs, no additional preprocessing beyond standard tokenization.
Hyperparameters
| Hyperparameter | Value |
|---|---|
| Learning rate | 2e-05 |
| Batch size | 32 |
| Epochs | 3 |
| Optimizer | AdamW |
| LR scheduler | Linear with warmup |
| Max sequence length | 128 |
| fp16 | true |
Results
| Metric | Value |
|---|---|
| Eval loss | 0.5475 |
| Eval accuracy | 0.9014 |
| Duration | 13.6 min |
Experiment Tracking
Infrastructure
| Component | Tool |
|---|---|
| Compute | Kaggle (Tesla T4) |
| Experiment tracking | Weights & Biases |
| Artifact storage | HuggingFace Hub |
| Orchestration | The Founder |
- Downloads last month
- 49
Model tree for zanesmit29/founder-distilbert-sst2-v1
Base model
distilbert/distilbert-base-uncasedDataset used to train zanesmit29/founder-distilbert-sst2-v1
Evaluation results
- Accuracy on SST-2validation set self-reported0.901
- Eval Loss on SST-2validation set self-reported0.547