founder-distilbert-sst2-v1

Fine-tuned by The Founder — an autonomous ML orchestration superagent.

Model Description

This is a fine-tuned version of distilbert-base-uncased on the stanfordnlp/sst2 dataset for Text Classification.

Training was orchestrated end-to-end by The Founder — a personal ML agent that handles research, compute scheduling, experiment tracking, and artifact management autonomously using Kaggle (Tesla T4), Weights & Biases, and HuggingFace Hub.

Model Details

Property	Value
Base model	distilbert-base-uncased
Fine-tuned on	stanfordnlp/sst2
Task	Text Classification
Epochs	3
Batch size	32
Learning rate	2e-05
GPU	Tesla T4 (Kaggle)
Eval loss	0.5475
Eval accuracy	0.9014
Duration	13.6 min

How to Get Started

from transformers import pipeline
clf = pipeline("text-classification", model="zanesmit29/founder-distilbert-sst2-v1")
clf("Your input text here")

Uses

Direct Use

This model is suitable for Text Classification tasks in English. It can be used out-of-the-box with the Transformers pipeline API.

Out-of-Scope Use

This model was trained on a specific dataset and may not generalise to all domains or languages. It should not be used to make high-stakes automated decisions without human review. Performance on out-of-distribution data (e.g. non-English text, domain-specific jargon) is not guaranteed.

Training Details

Data

Trained on stanfordnlp/sst2 — 3 epochs, no additional preprocessing beyond standard tokenization.

Hyperparameters

Hyperparameter	Value
Learning rate	2e-05
Batch size	32
Epochs	3
Optimizer	AdamW
LR scheduler	Linear with warmup
Max sequence length	128
fp16	true

Results

Metric	Value
Eval loss	0.5475
Eval accuracy	0.9014
Duration	13.6 min

Experiment Tracking

W&B Run

Infrastructure

Component	Tool
Compute	Kaggle (Tesla T4)
Experiment tracking	Weights & Biases
Artifact storage	HuggingFace Hub
Orchestration	The Founder

Downloads last month: 49

Safetensors

Model size

67M params

Tensor type

F32

Model tree for zanesmit29/founder-distilbert-sst2-v1

Base model

distilbert/distilbert-base-uncased

Finetuned

(11958)

this model

Dataset used to train zanesmit29/founder-distilbert-sst2-v1

Evaluation results

Accuracy on SST-2
validation set self-reported

0.901
Eval Loss on SST-2
validation set self-reported

0.547