tyqiangz
/

indobert-lite-large-p2-smsa

Text Classification

Model card Files Files and versions

tyqiangz commited on Oct 6, 2021

Commit

9b6b0dd

·

1 Parent(s): 40e087c

Fixed formatting error in README.md

Files changed (1) hide show

README.md +19 -19

README.md CHANGED Viewed

@@ -15,27 +15,8 @@ datasets:
 Finetuned the IndoBERT-Lite Large Model (phase2 - uncased) model on the IndoNLU SmSA dataset following the procedues stated in the paper [IndoNLU: Benchmark and Resources for Evaluating Indonesian
 Natural Language Understanding](https://arxiv.org/pdf/2009.05387.pdf).
-**Finetuning hyperparameters:**
-- learning rate: 2e-5
-- batch size: 16
-- no. of epochs: 5
-- max sequence length: 512
-- random seed: 42
-**Classes:**
-- 0: positive
-- 1: neutral
-- 2: negative
-Validation accuracy: 0.94
-Validation F1: 0.91
-Validation Recall: 0.91
-Validation Precision: 0.93
 ## How to use
-### Load model and tokenizer
 ```python
 from transformers import pipeline
 classifier = pipeline("text-classification",
@@ -51,3 +32,22 @@ Output:
   {'label': 'negative', 'score': 0.987165629863739}]]
 """
 ```

 Finetuned the IndoBERT-Lite Large Model (phase2 - uncased) model on the IndoNLU SmSA dataset following the procedues stated in the paper [IndoNLU: Benchmark and Resources for Evaluating Indonesian
 Natural Language Understanding](https://arxiv.org/pdf/2009.05387.pdf).
 ## How to use
 ```python
 from transformers import pipeline
 classifier = pipeline("text-classification",
   {'label': 'negative', 'score': 0.987165629863739}]]
 """
 ```
+**Finetuning hyperparameters:**
+- learning rate: 2e-5
+- batch size: 16
+- no. of epochs: 5
+- max sequence length: 512
+- random seed: 42
+**Classes:**
+- 0: positive
+- 1: neutral
+- 2: negative
+**Performance metrics on SmSA validation dataset**
+- Validation accuracy: 0.94
+- Validation F1: 0.91
+- Validation Recall: 0.91
+- Validation Precision: 0.93