---
base_model: sentence-transformers/paraphrase-mpnet-base-v2
library_name: sentence-transformers
pipeline_tag: sentence-similarity
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:2940
- loss:MultipleNegativesSymmetricRankingLoss
widget:
- source_sentence: 'Question:
Assign probabilities to single events using fractions. There are 4 red balls
and 3 yellow balls in a ∪. The image shows the ∪ with the 4 red balls and 3
yellow balls. What is the probability of picking a red ball?.
Answer: (4 / 3)'
sentences:
- Does not recognise difference of two squares
- Gives the probability out of the number of non-successes rather than the total
- Shares equally instead of in given ratio
- source_sentence: 'Question:
Convert from hours to minutes. 3 hours is the same as ___________ minutes.
Answer: 30'
sentences:
- Starts at the wrong end of the dividend when dividing
- Mixes up squaring and multiplying by 2 or doubling
- Thinks there are 10 minutes in an hour
- source_sentence: 'Question:
Divide integers up to 2 digits using long division. When using long-division to
solve this problem, what digit should replace the triangle to give the correct
answer (quotient)? Division sum written out in Bus Stop format with dividend =
92, divisor = 4, the digits of the quotient are represented by a yellow star and
a purple triangle..
Answer: 0'
sentences:
- Believes multiplying a positive by a negative gives a positive answer
- Confuses linear and reciprocal graphs
- Starts at the wrong end of the dividend when dividing
- source_sentence: 'Question:
Know that the factorised form of a quadratic is useful for finding where the curve
crosses the x-axis. y=(x-3)(x-5) Where does this curve intercept the x axis?.
Answer: (3,5) and (-3,-5)'
sentences:
- Calculates area when asked for perimeter
- Believes both the x and y co-ordinates of the x-intercept of a quadratic are derived
from the constants in the factorised form.
- Does not convert measurements to have the same units before calculating area or
volume
- source_sentence: 'Question:
Understand key loci terms like equidistant and perpendicular. A set of axes: x-axis
from -4 to 4, y-axis from -4 to 4. A red line is drawn from (-2,2) to (2,2). The
red line is _________ to the y axis..
Answer: Equidistant'
sentences:
- Does not know the definition of a rational number
- Believes squaring a negative number just changes the sign
- Does not know the meaning of perpendicular
---
# SentenceTransformer based on sentence-transformers/paraphrase-mpnet-base-v2
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/paraphrase-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-mpnet-base-v2) on the csv dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
## Model Details
### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [sentence-transformers/paraphrase-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-mpnet-base-v2)
- **Maximum Sequence Length:** 512 tokens
- **Output Dimensionality:** 768 tokens
- **Similarity Function:** Cosine Similarity
- **Training Dataset:**
- csv
### Model Sources
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
### Full Model Architecture
```
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: MPNetModel
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)
```
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("Gurveer05/paraphrase-mpnet-eedi-2024")
# Run inference
sentences = [
'Question:\nUnderstand key loci terms like equidistant and perpendicular. A set of axes: x-axis from -4 to 4, y-axis from -4 to 4. A red line is drawn from (-2,2) to (2,2). The red line is _________ to the y axis..\nAnswer: Equidistant',
'Does not know the meaning of perpendicular',
'Believes squaring a negative number just changes the sign',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
```
## Training Details
### Training Dataset
#### csv
* Dataset: csv
* Size: 2,940 training samples
* Columns: sentence1 and sentence2
* Approximate statistics based on the first 1000 samples:
| | sentence1 | sentence2 |
|:--------|:------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
| type | string | string |
| details |
Question:
Find missing angles in a scalene triangle. What is the size of angle p ? A triangle with angles labelled 49 degrees, 51 degrees and p [not to scale].
Answer: Not enough information | Does not know that angles in a triangle sum to 180 degrees |
| Question:
Solve quadratic equations using balancing. A student wishes to solve the equation below.
Which of the following is a correct next step?
(
(d+3)^2-25=0
)
Step 1
(
(d+3)^2=25
).
Answer: d+3=12.5 | Believes the inverse of square rooting is halving |
| Question:
Find missing angles using angles around a point. What is the size of angle x ? Angles around a point split into two parts, one is labelled 290 degrees and the other x.
Answer: 45° | Does not know that angles around a point sum to 360 |
* Loss: [MultipleNegativesSymmetricRankingLoss](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativessymmetricrankingloss) with these parameters:
```json
{
"scale": 20.0,
"similarity_fct": "cos_sim"
}
```
### Training Hyperparameters
#### Non-Default Hyperparameters
- `eval_strategy`: steps
- `per_device_train_batch_size`: 32
- `per_device_eval_batch_size`: 32
- `num_train_epochs`: 20
- `fp16`: True
- `load_best_model_at_end`: True
- `batch_sampler`: no_duplicates
#### All Hyperparameters