---
license: mit
base_model: jhu-clsp/ettin-encoder-32m
datasets:
- stanfordnlp/imdb
language:
- en
pipeline_tag: text-classification
tags:
- ettin
- modernbert
- text-classification
---

# ettin-encoder-32m-imdb-sentiment

`jhu-clsp/ettin-encoder-32m` (ModernBERT encoder, 8192 context) finetuned for **IMDB sentiment (binary)** on `stanfordnlp/imdb`.

## Results (held-out test)

| metric | value |
|---|---|
| accuracy | 0.9263 |
| macro-F1 | 0.9263 |
| eval max_length | 512 |

Finetuned on a single RTX 3080 (bf16). See the project for the full training pipeline.

## Usage

```python
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

tok = AutoTokenizer.from_pretrained("vumichien/ettin-encoder-32m-imdb-sentiment")
model = AutoModelForSequenceClassification.from_pretrained("vumichien/ettin-encoder-32m-imdb-sentiment")

inputs = tok("your text here", truncation=True, max_length=512, return_tensors="pt")
pred = model(**inputs).logits.argmax(-1).item()
print(model.config.id2label[pred])
```