Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Paper • 1908.10084 • Published • 14
How to use shahafvl/reasonir-8b-scientific-parent-prompt with sentence-transformers:
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("shahafvl/reasonir-8b-scientific-parent-prompt", trust_remote_code=True)
sentences = [
"The 3-T system offers an improved SNR and thus can improve the precision of T 1 mapping over the 1.5-T system. However, the TESO-IRFSE technique proposed here applies RF pulses frequently and thus scan time efficiency may be limited in some cases by the higher RF power deposition at 3 T. Our TESO-IRFSE technique has successfully increased the scan time efficiency compared to the conventional IRSE and IRFSE T 1 mapping techniques, and it produces precise and highly reproducible full-brain T 1 maps.",
"In quantitative T1 imaging, there is a trade-off between maximizing precision through higher field strengths and advanced pulse designs and maintaining acceptable scan efficiency under RF power deposition limits. Methods that optimize inversion-recovery sampling schemes and refocusing strategies can substantially reduce acquisition time compared with traditional protocols, while preserving or even enhancing the reliability of full-brain T1 estimates.",
"In integrative regulatory network models, adding biologically motivated sequence features via hierarchical or logit priors increases inferential accuracy but also makes posterior optimization challenging. This has prompted the development of streamlined model formulations and the adoption of Markov chain Monte Carlo techniques as substitutes or complements to variational EM approaches in order to reduce computation time while maintaining modeling flexibility.",
"Models with strong representational power are not inherently constrained to recover the true data-generating process; instead, they can conform closely to any observed sample, including its random variation. This creates a tension between flexibility and generalization, emphasizing the need for appropriate regularization, validation, and model selection procedures."
]
embeddings = model.encode(sentences)
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [4, 4]This is a sentence-transformers model finetuned from reasonir/ReasonIR-8B. It maps sentences & paragraphs to a 4096-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
SentenceTransformer(
(0): Transformer({'max_seq_length': 131072, 'do_lower_case': False, 'architecture': 'ReasonIRModel'})
(1): Pooling({'word_embedding_dimension': 4096, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': False})
(2): Normalize()
)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("shahafvl/reasonir-8b-scientific-parent-prompt")
# Run inference
sentences = [
'Running document analytics pipelines can be highly time-consuming, particularly as the underlying corpora expand at a rapid pace. In addition, these workloads typically demand substantial storage capacity and main memory. A widely adopted strategy to alleviate the storage pressure is to apply data compression.',
'Data-intensive analytics workloads are often both computationally expensive and demanding in terms of storage and memory resources, with costs escalating as datasets grow. One prevalent method for alleviating storage and memory pressure is to store data in compressed form.',
'Endmember variability in spectral unmixing encompasses both extrinsic factors, such as sensor geometry and illumination, and intrinsic factors related to the physical and chemical properties of the materials. While the former can often be approximated with analytical radiative transfer models, the latter is generally too complex to describe explicitly, motivating the adoption of statistical distributions, mixture models, or stochastic processes to model endmember spectra in high-dimensional spaces.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 4096]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.8138, 0.0302],
# [0.8138, 1.0000, 0.0370],
# [0.0302, 0.0370, 1.0000]])
ir_parent_grandparent_combinedInformationRetrievalEvaluator| Metric | Value |
|---|---|
| cosine_accuracy@1 | 1.0 |
| cosine_accuracy@3 | 1.0 |
| cosine_accuracy@5 | 1.0 |
| cosine_accuracy@10 | 1.0 |
| cosine_precision@1 | 1.0 |
| cosine_precision@3 | 1.0 |
| cosine_precision@5 | 0.987 |
| cosine_precision@10 | 0.5949 |
| cosine_recall@1 | 0.1667 |
| cosine_recall@3 | 0.5001 |
| cosine_recall@5 | 0.8225 |
| cosine_recall@10 | 0.9916 |
| cosine_ndcg@10 | 0.9924 |
| cosine_mrr@10 | 1.0 |
| cosine_map@100 | 0.9886 |
orig_vs_parent, orig_vs_grandparent, sib_vs_parent and sib_vs_grandparentTripletEvaluator| Metric | orig_vs_parent | orig_vs_grandparent | sib_vs_parent | sib_vs_grandparent |
|---|---|---|---|---|
| cosine_accuracy | 0.99 | 0.01 | 0.9896 | 0.0104 |
anchor and positive| anchor | positive | |
|---|---|---|
| type | string | string |
| details |
|
|
| anchor | positive |
|---|---|
LLZO is stable against Li metal and against a high voltage cathode. However, oxides generally require high sintering temperatures to remove grain boundaries to achieve the reported conductivity values. They also tend to be brittle, which makes it harder (relative to the sulfides) to maintain solid-solid interfacial contact and also to process. |
Oxide-based solid electrolytes often show excellent electrochemical stability with both lithium metal anodes and high-voltage cathodes, but exploiting their full ionic conductivity generally requires aggressive high-temperature sintering to suppress grain boundary resistance. The resulting brittle ceramic bodies can be challenging to process and to integrate mechanically, particularly when maintaining intimate solid–solid interfacial contact with electrodes is critical. |
LLZO is stable against Li metal and against a high voltage cathode. However, oxides generally require high sintering temperatures to remove grain boundaries to achieve the reported conductivity values. They also tend to be brittle, which makes it harder (relative to the sulfides) to maintain solid-solid interfacial contact and also to process. |
Oxide-based solid electrolytes often show excellent electrochemical stability with both lithium metal anodes and high-voltage cathodes, but exploiting their full ionic conductivity generally requires aggressive high-temperature sintering to suppress grain boundary resistance. The resulting brittle ceramic bodies can be challenging to process and to integrate mechanically, particularly when maintaining intimate solid–solid interfacial contact with electrodes is critical. |
LLZO is stable against Li metal and against a high voltage cathode. However, oxides generally require high sintering temperatures to remove grain boundaries to achieve the reported conductivity values. They also tend to be brittle, which makes it harder (relative to the sulfides) to maintain solid-solid interfacial contact and also to process. |
Oxide-based solid electrolytes often show excellent electrochemical stability with both lithium metal anodes and high-voltage cathodes, but exploiting their full ionic conductivity generally requires aggressive high-temperature sintering to suppress grain boundary resistance. The resulting brittle ceramic bodies can be challenging to process and to integrate mechanically, particularly when maintaining intimate solid–solid interfacial contact with electrodes is critical. |
MultipleNegativesRankingLoss with these parameters:{
"scale": 20.0,
"similarity_fct": "cos_sim",
"gather_across_devices": false
}
eval_strategy: stepslearning_rate: 1e-05num_train_epochs: 1warmup_ratio: 0.05bf16: Truedisable_tqdm: Trueload_best_model_at_end: Truepush_to_hub: Truehub_model_id: shahafvl/reasonir-8b-scientific-parent-prompthub_private_repo: Falseauto_find_batch_size: Trueprompts: {'anchor': 'Retrieve the broader scientific generalization or context for the given specific text.\nQuery: '}overwrite_output_dir: Falsedo_predict: Falseeval_strategy: stepsprediction_loss_only: Trueper_device_train_batch_size: 8per_device_eval_batch_size: 8per_gpu_train_batch_size: Noneper_gpu_eval_batch_size: Nonegradient_accumulation_steps: 1eval_accumulation_steps: Nonetorch_empty_cache_steps: Nonelearning_rate: 1e-05weight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1.0num_train_epochs: 1max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: {}warmup_ratio: 0.05warmup_steps: 0log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Truesave_safetensors: Truesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseno_cuda: Falseuse_cpu: Falseuse_mps_device: Falseseed: 42data_seed: Nonejit_mode_eval: Falsebf16: Truefp16: Falsefp16_opt_level: O1half_precision_backend: autobf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: 0ddp_backend: Nonetpu_num_cores: Nonetpu_metrics_debug: Falsedebug: []dataloader_drop_last: Falsedataloader_num_workers: 0dataloader_prefetch_factor: Nonepast_index: -1disable_tqdm: Trueremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Trueignore_data_skip: Falsefsdp: []fsdp_min_num_params: 0fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap: Noneaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}parallelism_config: Nonedeepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torch_fusedoptim_args: Noneadafactor: Falsegroup_by_length: Falselength_column_name: lengthproject: huggingfacetrackio_space_id: trackioddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Truedataloader_persistent_workers: Falseskip_memory_metrics: Trueuse_legacy_prediction_loop: Falsepush_to_hub: Trueresume_from_checkpoint: Nonehub_model_id: shahafvl/reasonir-8b-scientific-parent-prompthub_strategy: every_savehub_private_repo: Falsehub_always_push: Falsehub_revision: Nonegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_inputs_for_metrics: Falseinclude_for_metrics: []eval_do_concat_batches: Truefp16_backend: autopush_to_hub_model_id: Nonepush_to_hub_organization: Nonemp_parameters: auto_find_batch_size: Truefull_determinism: Falsetorchdynamo: Noneray_scope: lastddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneinclude_tokens_per_second: Falseinclude_num_input_tokens_seen: noneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseuse_liger_kernel: Falseliger_kernel_config: Noneeval_use_gather_object: Falseaverage_tokens_across_devices: Trueprompts: {'anchor': 'Retrieve the broader scientific generalization or context for the given specific text.\nQuery: '}batch_sampler: batch_samplermulti_dataset_batch_sampler: proportionalrouter_mapping: {}learning_rate_mapping: {}| Epoch | Step | Training Loss | ir_parent_grandparent_combined_cosine_ndcg@10 | orig_vs_parent_cosine_accuracy | orig_vs_grandparent_cosine_accuracy | sib_vs_parent_cosine_accuracy | sib_vs_grandparent_cosine_accuracy |
|---|---|---|---|---|---|---|---|
| 0.0049 | 100 | 0.0293 | - | - | - | - | - |
| 0.0099 | 200 | 0.003 | - | - | - | - | - |
| 0.0148 | 300 | 0.0036 | - | - | - | - | - |
| 0.0198 | 400 | 0.0006 | - | - | - | - | - |
| 0.0247 | 500 | 0.0004 | - | - | - | - | - |
| 0.0296 | 600 | 0.0004 | - | - | - | - | - |
| 0.0346 | 700 | 0.0022 | - | - | - | - | - |
| 0.0395 | 800 | 0.0002 | - | - | - | - | - |
| 0.0444 | 900 | 0.0001 | - | - | - | - | - |
| 0.0494 | 1000 | 0.0003 | - | - | - | - | - |
| 0.0543 | 1100 | 0.0028 | - | - | - | - | - |
| 0.0593 | 1200 | 0.002 | - | - | - | - | - |
| 0.0642 | 1300 | 0.0027 | - | - | - | - | - |
| 0.0691 | 1400 | 0.0003 | - | - | - | - | - |
| 0.0741 | 1500 | 0.0021 | - | - | - | - | - |
| 0.0790 | 1600 | 0.0001 | - | - | - | - | - |
| 0.0840 | 1700 | 0.0001 | - | - | - | - | - |
| 0.0889 | 1800 | 0.0018 | - | - | - | - | - |
| 0.0938 | 1900 | 0.0001 | - | - | - | - | - |
| 0.0988 | 2000 | 0.0057 | - | - | - | - | - |
| 0.1037 | 2100 | 0.0028 | - | - | - | - | - |
| 0.1086 | 2200 | 0.0027 | - | - | - | - | - |
| 0.1136 | 2300 | 0.0 | - | - | - | - | - |
| 0.1185 | 2400 | 0.0 | - | - | - | - | - |
| 0.1235 | 2500 | 0.0001 | - | - | - | - | - |
| 0.1284 | 2600 | 0.0045 | - | - | - | - | - |
| 0.1333 | 2700 | 0.0018 | - | - | - | - | - |
| 0.1383 | 2800 | 0.0037 | - | - | - | - | - |
| 0.1432 | 2900 | 0.0088 | - | - | - | - | - |
| 0.1482 | 3000 | 0.0069 | - | - | - | - | - |
| 0.1531 | 3100 | 0.0027 | - | - | - | - | - |
| 0.1580 | 3200 | 0.0001 | - | - | - | - | - |
| 0.1630 | 3300 | 0.0002 | - | - | - | - | - |
| 0.1679 | 3400 | 0.002 | - | - | - | - | - |
| 0.1728 | 3500 | 0.0001 | - | - | - | - | - |
| 0.1778 | 3600 | 0.0023 | - | - | - | - | - |
| 0.1827 | 3700 | 0.0001 | - | - | - | - | - |
| 0.1877 | 3800 | 0.0001 | - | - | - | - | - |
| 0.1926 | 3900 | 0.0001 | - | - | - | - | - |
| 0.1975 | 4000 | 0.0027 | - | - | - | - | - |
| 0.2025 | 4100 | 0.0 | - | - | - | - | - |
| 0.2074 | 4200 | 0.0027 | - | - | - | - | - |
| 0.2124 | 4300 | 0.0027 | - | - | - | - | - |
| 0.2173 | 4400 | 0.0027 | - | - | - | - | - |
| 0.2222 | 4500 | 0.0 | - | - | - | - | - |
| 0.2272 | 4600 | 0.0001 | - | - | - | - | - |
| 0.2321 | 4700 | 0.0 | - | - | - | - | - |
| 0.2370 | 4800 | 0.002 | - | - | - | - | - |
| 0.2420 | 4900 | 0.0069 | - | - | - | - | - |
| 0.2469 | 5000 | 0.0 | - | - | - | - | - |
| 0.2519 | 5100 | 0.0016 | - | - | - | - | - |
| 0.2568 | 5200 | 0.0001 | - | - | - | - | - |
| 0.2617 | 5300 | 0.0 | - | - | - | - | - |
| 0.2667 | 5400 | 0.0022 | - | - | - | - | - |
| 0.2716 | 5500 | 0.0018 | - | - | - | - | - |
| 0.2766 | 5600 | 0.0019 | - | - | - | - | - |
| 0.2815 | 5700 | 0.0 | - | - | - | - | - |
| 0.2864 | 5800 | 0.0 | - | - | - | - | - |
| 0.2914 | 5900 | 0.0018 | - | - | - | - | - |
| 0.2963 | 6000 | 0.0 | - | - | - | - | - |
| 0.3012 | 6100 | 0.0001 | - | - | - | - | - |
| 0.3062 | 6200 | 0.0 | - | - | - | - | - |
| 0.3111 | 6300 | 0.002 | - | - | - | - | - |
| 0.3161 | 6400 | 0.0021 | - | - | - | - | - |
| 0.3210 | 6500 | 0.0 | - | - | - | - | - |
| 0.3259 | 6600 | 0.0032 | - | - | - | - | - |
| 0.3309 | 6700 | 0.002 | - | - | - | - | - |
| 0.3358 | 6800 | 0.0018 | - | - | - | - | - |
| 0.3408 | 6900 | 0.0001 | - | - | - | - | - |
| 0.3457 | 7000 | 0.0018 | - | - | - | - | - |
| 0.3506 | 7100 | 0.0 | - | - | - | - | - |
| 0.3556 | 7200 | 0.0 | - | - | - | - | - |
| 0.3605 | 7300 | 0.0018 | - | - | - | - | - |
| 0.3655 | 7400 | 0.0 | - | - | - | - | - |
| 0.3704 | 7500 | 0.0052 | 0.9905 | 0.9900 | 0.0100 | 0.9892 | 0.0108 |
| 0.3753 | 7600 | 0.0036 | - | - | - | - | - |
| 0.3803 | 7700 | 0.0022 | - | - | - | - | - |
| 0.3852 | 7800 | 0.0 | - | - | - | - | - |
| 0.3901 | 7900 | 0.0 | - | - | - | - | - |
| 0.3951 | 8000 | 0.0025 | - | - | - | - | - |
| 0.4000 | 8100 | 0.0021 | - | - | - | - | - |
| 0.4050 | 8200 | 0.0 | - | - | - | - | - |
| 0.4099 | 8300 | 0.0 | - | - | - | - | - |
| 0.4148 | 8400 | 0.0 | - | - | - | - | - |
| 0.4198 | 8500 | 0.0001 | - | - | - | - | - |
| 0.4247 | 8600 | 0.0 | - | - | - | - | - |
| 0.4297 | 8700 | 0.0018 | - | - | - | - | - |
| 0.4346 | 8800 | 0.0 | - | - | - | - | - |
| 0.4395 | 8900 | 0.0001 | - | - | - | - | - |
| 0.4445 | 9000 | 0.0 | - | - | - | - | - |
| 0.4494 | 9100 | 0.0039 | - | - | - | - | - |
| 0.4543 | 9200 | 0.0042 | - | - | - | - | - |
| 0.4593 | 9300 | 0.0019 | - | - | - | - | - |
| 0.4642 | 9400 | 0.0023 | - | - | - | - | - |
| 0.4692 | 9500 | 0.0 | - | - | - | - | - |
| 0.4741 | 9600 | 0.0 | - | - | - | - | - |
| 0.4790 | 9700 | 0.0019 | - | - | - | - | - |
| 0.4840 | 9800 | 0.0 | - | - | - | - | - |
| 0.4889 | 9900 | 0.0019 | - | - | - | - | - |
| 0.4939 | 10000 | 0.0 | - | - | - | - | - |
| 0.4988 | 10100 | 0.0048 | - | - | - | - | - |
| 0.5037 | 10200 | 0.0 | - | - | - | - | - |
| 0.5087 | 10300 | 0.0018 | - | - | - | - | - |
| 0.5136 | 10400 | 0.0013 | - | - | - | - | - |
| 0.5185 | 10500 | 0.0043 | - | - | - | - | - |
| 0.5235 | 10600 | 0.0001 | - | - | - | - | - |
| 0.5284 | 10700 | 0.0 | - | - | - | - | - |
| 0.5334 | 10800 | 0.0 | - | - | - | - | - |
| 0.5383 | 10900 | 0.0 | - | - | - | - | - |
| 0.5432 | 11000 | 0.0027 | - | - | - | - | - |
| 0.5482 | 11100 | 0.0062 | - | - | - | - | - |
| 0.5531 | 11200 | 0.0001 | - | - | - | - | - |
| 0.5581 | 11300 | 0.0001 | - | - | - | - | - |
| 0.5630 | 11400 | 0.0001 | - | - | - | - | - |
| 0.5679 | 11500 | 0.0048 | - | - | - | - | - |
| 0.5729 | 11600 | 0.0 | - | - | - | - | - |
| 0.5778 | 11700 | 0.0012 | - | - | - | - | - |
| 0.5827 | 11800 | 0.0026 | - | - | - | - | - |
| 0.5877 | 11900 | 0.0037 | - | - | - | - | - |
| 0.5926 | 12000 | 0.0 | - | - | - | - | - |
| 0.5976 | 12100 | 0.0 | - | - | - | - | - |
| 0.6025 | 12200 | 0.0059 | - | - | - | - | - |
| 0.6074 | 12300 | 0.0 | - | - | - | - | - |
| 0.6124 | 12400 | 0.0039 | - | - | - | - | - |
| 0.6173 | 12500 | 0.003 | - | - | - | - | - |
| 0.6223 | 12600 | 0.0 | - | - | - | - | - |
| 0.6272 | 12700 | 0.0 | - | - | - | - | - |
| 0.6321 | 12800 | 0.0 | - | - | - | - | - |
| 0.6371 | 12900 | 0.0036 | - | - | - | - | - |
| 0.6420 | 13000 | 0.0071 | - | - | - | - | - |
| 0.6469 | 13100 | 0.0 | - | - | - | - | - |
| 0.6519 | 13200 | 0.0 | - | - | - | - | - |
| 0.6568 | 13300 | 0.0 | - | - | - | - | - |
| 0.6618 | 13400 | 0.0 | - | - | - | - | - |
| 0.6667 | 13500 | 0.0033 | - | - | - | - | - |
| 0.6716 | 13600 | 0.0 | - | - | - | - | - |
| 0.6766 | 13700 | 0.0 | - | - | - | - | - |
| 0.6815 | 13800 | 0.0001 | - | - | - | - | - |
| 0.6865 | 13900 | 0.0018 | - | - | - | - | - |
| 0.6914 | 14000 | 0.0001 | - | - | - | - | - |
| 0.6963 | 14100 | 0.0036 | - | - | - | - | - |
| 0.7013 | 14200 | 0.0015 | - | - | - | - | - |
| 0.7062 | 14300 | 0.0001 | - | - | - | - | - |
| 0.7111 | 14400 | 0.0074 | - | - | - | - | - |
| 0.7161 | 14500 | 0.0 | - | - | - | - | - |
| 0.7210 | 14600 | 0.0035 | - | - | - | - | - |
| 0.7260 | 14700 | 0.0016 | - | - | - | - | - |
| 0.7309 | 14800 | 0.0018 | - | - | - | - | - |
| 0.7358 | 14900 | 0.0022 | - | - | - | - | - |
| 0.7408 | 15000 | 0.0 | 0.9924 | 0.99 | 0.01 | 0.9896 | 0.0104 |
| 0.7457 | 15100 | 0.0023 | - | - | - | - | - |
| 0.7507 | 15200 | 0.0018 | - | - | - | - | - |
| 0.7556 | 15300 | 0.0 | - | - | - | - | - |
| 0.7605 | 15400 | 0.0 | - | - | - | - | - |
| 0.7655 | 15500 | 0.0036 | - | - | - | - | - |
| 0.7704 | 15600 | 0.0 | - | - | - | - | - |
| 0.7753 | 15700 | 0.0 | - | - | - | - | - |
| 0.7803 | 15800 | 0.0 | - | - | - | - | - |
| 0.7852 | 15900 | 0.0036 | - | - | - | - | - |
| 0.7902 | 16000 | 0.0018 | - | - | - | - | - |
| 0.7951 | 16100 | 0.0019 | - | - | - | - | - |
| 0.8000 | 16200 | 0.003 | - | - | - | - | - |
| 0.8050 | 16300 | 0.0034 | - | - | - | - | - |
| 0.8099 | 16400 | 0.0 | - | - | - | - | - |
| 0.8149 | 16500 | 0.0 | - | - | - | - | - |
| 0.8198 | 16600 | 0.006 | - | - | - | - | - |
| 0.8247 | 16700 | 0.0018 | - | - | - | - | - |
| 0.8297 | 16800 | 0.0029 | - | - | - | - | - |
| 0.8346 | 16900 | 0.0 | - | - | - | - | - |
| 0.8395 | 17000 | 0.0018 | - | - | - | - | - |
| 0.8445 | 17100 | 0.0014 | - | - | - | - | - |
| 0.8494 | 17200 | 0.0032 | - | - | - | - | - |
| 0.8544 | 17300 | 0.0 | - | - | - | - | - |
| 0.8593 | 17400 | 0.0063 | - | - | - | - | - |
| 0.8642 | 17500 | 0.0 | - | - | - | - | - |
| 0.8692 | 17600 | 0.0002 | - | - | - | - | - |
| 0.8741 | 17700 | 0.0026 | - | - | - | - | - |
| 0.8791 | 17800 | 0.0069 | - | - | - | - | - |
| 0.8840 | 17900 | 0.0026 | - | - | - | - | - |
| 0.8889 | 18000 | 0.0061 | - | - | - | - | - |
| 0.8939 | 18100 | 0.0 | - | - | - | - | - |
| 0.8988 | 18200 | 0.0026 | - | - | - | - | - |
| 0.9037 | 18300 | 0.0 | - | - | - | - | - |
| 0.9087 | 18400 | 0.0 | - | - | - | - | - |
| 0.9136 | 18500 | 0.0 | - | - | - | - | - |
| 0.9186 | 18600 | 0.0022 | - | - | - | - | - |
| 0.9235 | 18700 | 0.0 | - | - | - | - | - |
| 0.9284 | 18800 | 0.0 | - | - | - | - | - |
| 0.9334 | 18900 | 0.0016 | - | - | - | - | - |
| 0.9383 | 19000 | 0.0076 | - | - | - | - | - |
| 0.9433 | 19100 | 0.0 | - | - | - | - | - |
| 0.9482 | 19200 | 0.0026 | - | - | - | - | - |
| 0.9531 | 19300 | 0.0 | - | - | - | - | - |
| 0.9581 | 19400 | 0.0 | - | - | - | - | - |
| 0.9630 | 19500 | 0.0018 | - | - | - | - | - |
| 0.9679 | 19600 | 0.002 | - | - | - | - | - |
| 0.9729 | 19700 | 0.0042 | - | - | - | - | - |
| 0.9778 | 19800 | 0.0044 | - | - | - | - | - |
| 0.9828 | 19900 | 0.0 | - | - | - | - | - |
| 0.9877 | 20000 | 0.0 | - | - | - | - | - |
| 0.9926 | 20100 | 0.0 | - | - | - | - | - |
| 0.9976 | 20200 | 0.0018 | - | - | - | - | - |
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}