Hierarchical BERT
Collection
Set of BERT models with Hierarchical attention pre-trained on conversational data to process multiple utterances at once • 8 items • Updated
How to use igorktech/hibial-bert-i3-mlm2 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("fill-mask", model="igorktech/hibial-bert-i3-mlm2", trust_remote_code=True) # Load model directly
from transformers import AutoModelForMaskedLM
model = AutoModelForMaskedLM.from_pretrained("igorktech/hibial-bert-i3-mlm2", trust_remote_code=True, dtype="auto")YAML Metadata Error:"base_model" with value "/gpfs/home/ikuzmin/hier-bert-pytorch/data/hibial-model" is not valid. Use a model id from https://hf.co/models.
This model is a fine-tuned version of /gpfs/home/ikuzmin/hier-bert-pytorch/data/hibial-model on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|---|---|---|---|---|
| 2.7518 | 1.55 | 25000 | 2.5873 | 0.5213 |
| 2.2587 | 3.1 | 50000 | 2.1487 | 0.5824 |