martian786/agnews-salient-salient-tfidf-ner-k16-seed-1

This repository contains one trained run from the AG News salience experiment.

Run details

  • Run name: salient_tfidf_ner_k16
  • Variant: salient_tfidf_ner
  • Base model: roberta-base
  • Seed: 1
  • Token budget for compressed variants: 16
  • Maximum RoBERTa sequence length: 128
  • Training examples: 28500
  • Validation examples: 6000
  • Test examples: 7600

Labels

  • 0: World
  • 1: Sports
  • 2: Business
  • 3: Sci/Tech

Results

Metric Value
Validation accuracy 0.9087
Validation macro F1 0.9085
Test accuracy 0.9088
Test macro F1 0.9087

Uploaded files

This repository includes:

  • model weights, config, and tokenizer at the repository root
  • test_data.csv β€” transformed test data used for this run
  • val_data.csv β€” transformed validation data used for this run
  • train_data_sample.csv β€” sample of transformed training data
  • full_test_predictions.csv β€” full test predictions
  • metrics.json β€” run metrics
  • classification_report.json β€” per-class classification report
  • confusion_matrix.csv β€” confusion matrix
  • trainer_log_history.csv β€” Trainer log history, if available
  • PNG plots for learning curves and final test metrics

Intended use

This model is intended for experiment tracking and reproducibility of AG News classification runs.

It is not intended as a production classifier without further validation.

Reproducibility

The original experiment used:

MODEL_NAME = "roberta-base"
SEED = 1
TOKEN_BUDGET = 16
MAX_SEQ_LEN = 128
TRAIN_SAMPLES = 28500
EPOCHS = 3
BATCH_SIZE = 16
LR = 2e-05
WEIGHT_DECAY = 0.01

Reloading

from transformers import AutoTokenizer, AutoModelForSequenceClassification

repo_id = "martian786/agnews-salient-salient-tfidf-ner-k16-seed-1"
tokenizer = AutoTokenizer.from_pretrained(repo_id)
model = AutoModelForSequenceClassification.from_pretrained(repo_id)
Downloads last month
2
Safetensors
Model size
0.1B params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for martian786/agnews-salient-salient-tfidf-ner-k16-seed-1

Finetuned
(2330)
this model

Dataset used to train martian786/agnews-salient-salient-tfidf-ner-k16-seed-1