founder-distilbert-sst2-v1

Fine-tuned by The Founder — an autonomous ML orchestration superagent.

Model Description

This is a fine-tuned version of distilbert-base-uncased on the stanfordnlp/sst2 dataset for Text Classification.

Training was orchestrated end-to-end by The Founder — a personal ML agent that handles research, compute scheduling, experiment tracking, and artifact management autonomously using Kaggle (Tesla T4), Weights & Biases, and HuggingFace Hub.

Model Details

Property Value
Base model distilbert-base-uncased
Fine-tuned on stanfordnlp/sst2
Task Text Classification
Epochs 3
Batch size 32
Learning rate 2e-05
GPU Tesla T4 (Kaggle)
Eval loss 0.5475
Eval accuracy 0.9014
Duration 13.6 min

How to Get Started

from transformers import pipeline
clf = pipeline("text-classification", model="zanesmit29/founder-distilbert-sst2-v1")
clf("Your input text here")

Uses

Direct Use

This model is suitable for Text Classification tasks in English. It can be used out-of-the-box with the Transformers pipeline API.

Out-of-Scope Use

This model was trained on a specific dataset and may not generalise to all domains or languages. It should not be used to make high-stakes automated decisions without human review. Performance on out-of-distribution data (e.g. non-English text, domain-specific jargon) is not guaranteed.

Training Details

Data

Trained on stanfordnlp/sst2 — 3 epochs, no additional preprocessing beyond standard tokenization.

Hyperparameters

Hyperparameter Value
Learning rate 2e-05
Batch size 32
Epochs 3
Optimizer AdamW
LR scheduler Linear with warmup
Max sequence length 128
fp16 true

Results

Metric Value
Eval loss 0.5475
Eval accuracy 0.9014
Duration 13.6 min

Experiment Tracking

W&B Run

Infrastructure

Component Tool
Compute Kaggle (Tesla T4)
Experiment tracking Weights & Biases
Artifact storage HuggingFace Hub
Orchestration The Founder
Downloads last month
49
Safetensors
Model size
67M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for zanesmit29/founder-distilbert-sst2-v1

Finetuned
(11958)
this model

Dataset used to train zanesmit29/founder-distilbert-sst2-v1

Evaluation results