File size: 72,583 Bytes
1f22f3b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
---
tags:
- sentence-transformers
- sparse-encoder
- sparse
- generated_from_trainer
- dataset_size:404856
- loss:SpladeMixedTopKLoss
- loss:FlopsLoss
widget:
- text: 'I use these at work for when things break - sort of like a poor-man''s build
    monitor.  It catches people''s attention and they know "something" is happening.
    I have both the Red and Yellow colors so I can distinguish between "something''s
    wrong" and "OMG - OUTAGE!!!!" (I deal with a major website). Some people complain
    about how bright it is - I don''t really care. During the first day that I fired
    up the Yellow (something''s wrong), I had about 10 people stop by and they were
    like "hey.... so - what''s that cause it''s f''n annoying....." They quickly found
    out that it''s a "go away, yes, something is wrong" light. I''ve had developers
    complain that they''re gonna have seizures (hasn''t happened yet) and people have
    figure out that, if the light is on - stay away unless you''re involved.  Other
    folks have actually changed how I present this light, due to how bright it is.
    Some people don''t like the reflection (my closest cube mates), while others think
    it needs to be seen by EVERYONE. We came up with our own compromise - custom molded
    tin-foil.  ROAD USE: I''m also a a road flare (the real kind) user. These, in
    my opinion, would not be the greatest thing to use. I''m going to buy my wife
    one for her motorcycle, but that''s the only reason why (elements due to her leather
    strapped bags). A car needs something much brighter. While these have multiple
    patterns, it''s not great for anything in the city. If you''re in the middle of
    nowhere, sure.  Edit: I''ve updated this a little. I finally ran one of mine out
    of batteries. It basically did all kinds of funky things during it''s last few
    minutes and I pulled the plug. I have the kind with a Lithium CR123A 3V battery.
    It lasted a LONG time compared to what I originally expected. I use these lights
    for issues that last 45 minutes to an hour. For two full days, this thing ran
    like a champ without issues (note: 8 hrs per day). Overall, I''d estimate the
    run time to be around 25-30 hours of solid use. The battery replacement for this
    model is a pain, but that''s good. There are 2 screws to remove, then you have
    to pry the top off. The painful part is the top - it has an o-ring (GOOD!) that
    protects the electronics. After about 3-4 minutes, I was able to get it open and
    pull the battery.  After seeing the run-time and construction on the inside, I''m
    changing this to 5-stars - but again, for the Lithium version. I hadn''t ordered
    my wife''s yet - I''m ordering it today.  If you''re a developer or admin looking
    for a poor-man''s" trouble in little china" light. Eventually, I''ll probably
    move it over to an Adrino or Pi, but for now, pushing a button and slapping it
    in a "custom aluminum foil reflector" is good enough for now.'
- text: This was a gift from my daughter...  I probably wouldn't have spent as much
    on a travel mug, but, after using this mug for about a month, I find it worth
    the extra money, as opposed to buying a cheaper mug.  First off, the mug itself
    is made of stainless steel, and will not break like a plastic mug would.  This
    includes the interior and exterior of the mug.  I have had mugs that were stainless
    steel on the outside, but plastic on the inside, and those mugs eventually separated,
    and generally did not last long.  Of course, the lid is made of plastic and rubber
    materials, but is made to be durable, and appears to be very well constructed.  This
    mug seals tight.  Generally a travel mug must be kept upright at all times, as
    the closed position is not much more than a piece of plastic that slides over
    a hole.  In this mug, there is a plastic part that has a rubber seal on it, and
    when in the closed position, seals the mug tight, equivalent to the seal for a
    thermos bottle, and prevents any leakage.  I actually put this in my briefcase
    when walking into my office with no worries of leaks or spills in the briefcase.  My
    other travel mugs never allowed this.  I have also had hot coffee in this mug
    for 5+ hours, and found the coffee to be hot after that amount of time.  Although
    the instructions allows for top rack dishwasher cleaning, I believe it would be
    best to clean this by hand, especially with regards to the parts in the lid.  The
    lid comes apart into three pieces, and also has seals that are on those pieces.  I
    would not recommend any of those parts in the dishwasher, and cleaning the mug
    is not a difficult chore.  Overall, I'm giving this mug 5 stars, as the only issue
    I find not in its favor is its price, but compared to a $10 mug, made of plastic,
    this mug is far superior, and worth the extra money.  I have not had this mug
    for several years, but expect it to last at least a few years, and possibly longer
    with little or no issues.  My recommendation for others looking for a travel mug
    would be to buy one of these, based on its ability to keep liquids hot, being
    able to seal tightly and not leak your drink if tipped, and the excellent construction
    of this mug.
- text: First, the reader should be advised that I'm an "old school" guy that has
    spent a lifetime grousing about the cost of shoes ... ALL kinds of shoes ... tennis
    shoes ... dress shoes ... hiking shoes ... flip-flops ... you name it.  I'm the
    guy that regularly prowls the shoe section at re-sellers like Marshalls and Ross,
    always searching for that once-in-a-lifetime bargain/value that miraculously shows
    up in my size ... "you just gotta get there on the right day ..." Am I right,
    men?  While all that prowling has undoubtedly saved me a small fortune over the
    years, it also taught me something.  Sometimes, you just can't get what you really
    want unless you're willing to spend what it costs.  That's the way I feel about
    these shoes.  I knew I wanted a captoe Oxford in this color, and I spent a LONG
    time looking at LOTS of different makers/designers on Amazon.  I never found anything
    that looked as nice as these Cole Haans.  For the reasons explained above, I never
    previously bought this label, so I wasn't sure about the sizing.  My last pair
    of high-end dress shoes were Bostonians (and yes ... I groused bitterly about
    the cost when I broke down and bought them), which "ran short."  I got those shoes
    in 13D, which fit and served me well through some pretty heavy use over a number
    of years.  So, I figured I'd try the same in the Cole Haans.  My mistake ... in
    13D they fit just OK at the widest part of my foot (a little tight, but not uncomfortable),
    but WAY too long in the toe, and just didn't fit my foot right.  Didn't have a
    lot of time to mess around, so I put the first pair aside and re-ordered another
    pair in 12EE (W), figuring that adjustment would make the perfect fit, and then
    I'd return the first pair, after I was sure.  The second pair fit my foot like
    a glove, right out of the box.  Very comfortable, and a perfect fit for my foot.  Unfortunately,
    with the second pair, I also noticed a small irregularity in the upper sole stitching
    on one of the two, right at the ball of the foot, and a blemish in the tanning
    on the tongue of the other.  Knowing I would never be happy if I "settled" for
    that, I put that pair aside too, and ordered a THIRD pair, also in 12EE (W).  The
    third time was indeed the charm ... this pair was perfect in every respect, and
    in my opinion, worth every penny they cost.  The "British tan" finish is really
    nice, and exactly as shown in the photo.  The stitching is flawless.  The leather
    is supple and uniform throughout every surface of the shoe.  The heels and soles
    are very thick and strong ... a bit stiff at first, but I'm a big guy, so that's
    fine with me.  My Bostonians had leather soles, and I realized (after wearing
    completely through one of them) that rubber soles, like these, work better for
    me because I do as much walking on outdoor surfaces like concrete and asphalt
    as I do on tile and other indoor surfaces.  Amazon made my return of the first
    two pairs a breeze, and they started processing a full refund on my credit card
    the same day I dropped the return boxes off with UPS.  Thanks, Amazon ... you
    made this particular online shopping experience a real pleasure for me -- not
    only in helping me find these terrific shoes, but also getting them to me quickly,
    in the perfect condition and size I needed.  Once you figure out what size actually
    fits you, you'll love these Cole Haans ...
- text: 'I had no problems installing then pairing this to my Z-Wave SmartThings hub
    and 2GIG alarm system. But it was replacing an older KW 910 (that lasted well
    over past couple years) so I had prior experience ;-)  A couple of notables:  -
    I realized I needed a 10-key pad instead of the previous 5-key pad for better
    security. Mathematically it is SIXTEEN TIMES harder for an intruder to guess any
    valid 4-digit code than on a 5-key pad! And don''t forget that for every unique
    user code you add (spouse, guest, maid, etc) divides those intruder chances for
    less security (ie, a 5-key pad at best is 1:625 odds, but with 31 user codes now
    only 31:625 or a 5% chance of hitting a valid code on the 1st try or a likely
    hit within 20 tries!) - Regardless the chances, do NOT tie your lock to automatically
    disarm your alarm. It''s just stupid. Sure, the same code was first entered in
    the lock keypad, but make it hard for intruders then facing an alarm countdown
    try to remember that. - Like any Z-wave lock (which all have extra AES signal
    encryption) consider powering up this lock BEFORE installing it in the door so
    you can be close (like <3 ft) to your Z-wave hub/alarm panel to pair/include it.
    Otherwise, you''ll have pairing problems like you see so many people here. - The
    "random digits" feature is nice but confusing to explain to guests like the maid
    to follow two numbers before then entering their code. It is intended so that
    dusting the panel is not obvious where four most common fingerprints are. Maybe
    nice if you share just one common code. Or just remember to tap around the screen
    from time to time. - My only gripe is that the keypad really could be more sensitive
    to waking up. My wife first complained then guests complained that you have to
    REALLY press hard to wake up the panel to then see the numbers. It is especially
    annoying when you have your hands full on the way out and have to wait for the
    lock icon to appear. They probably should have kept the lock action as a physical
    button like my old KW 910. I ended up turning on the 32-sec auto lock DIP switch.'
- text: 'This is a really great book and I highly recommend it, both as a compelling
    autobiography and as a way to learn about Islam and Islamism.  One thing I learned
    from this book is the tremendous variation within Islam. The distinction that
    was most important to me was the difference between Religious (or Spiritual) Islam
    and Political Islam.  The values and social ideals of Religious Islam are virtually
    identical to Christiaity and Buddhism (the two world religion with which I am
    most familiar).  The emphasis on care for the poor, the needy, and less fortunate
    is indistinguishable among these religions.  Further, Islamic mysticism describes
    Allah in the same sacred language used by mystics of all stripes.  This blends
    well with our religious heritage and adds new inspiring stories and language that
    will benefit all with an open mind.  On the other side is Political Islam, it''s
    hatred of the West, it''s lack of sound Islamic scholarship and spirituality,
    it''s bigotry and shameful treatment of women, it''s dedication to an Islamic
    State, and it''s willingness to kill in order to attain it.  This form of Islam,
    and I hesitate even calling it Islam given it''s shallow scholarship and deviance
    from fundamental Islamic values and teachings, breeds extremism and terrorism.  This
    form of Islam poses a concrete threat to America and other Western countries.  Within
    our country, as well as countries throughout the world, this twisted form of Islam
    is taught, primarily, by missionaries from Saudia Arabia.  They have combined
    a rigid, severe, and literalist version of Islam that dates to the mid 1700s with
    the political goal of transforming the world into a single Islamic state.  They
    are True Believers of the worst kind and the foundation on which Islamic terrorists
    stand.  Note:  After reading The Islamist and writing the review above, I read
    Islam by <PERSON>, an excellent overview of Islam.  One point that he makes is
    that Islam is not just a religion, but a way of life, and that Islam permeates
    all aspects of the Muslim,s life, including politics.  Because this applies to
    all forms of Islam, the distinction made in The Islamist, and repeated in my review,
    between "Religious Islam" and "Political Islam" is not correct, or at least not
    precise.  Other labels also have there shortcomings, so I will let the wording
    used by the author stand.  Nonetheless, The Islamist remains and excellent, powerful,
    and intensely personal work.'
pipeline_tag: feature-extraction
library_name: sentence-transformers
metrics:
- dot_accuracy@1
- dot_accuracy@8
- dot_accuracy@50
- dot_accuracy@100
- dot_precision@1
- dot_precision@8
- dot_precision@50
- dot_precision@100
- dot_recall@1
- dot_recall@8
- dot_recall@50
- dot_recall@100
- dot_ndcg@10
- dot_mrr@10
- dot_map@100
- query_active_dims
- query_sparsity_ratio
- corpus_active_dims
- corpus_sparsity_ratio
- avg_flops
model-index:
- name: Sparse Encoder
  results:
  - task:
      type: sparse-information-retrieval
      name: Sparse Information Retrieval
    dataset:
      name: sparse ir eval
      type: sparse-ir-eval
    metrics:
    - type: dot_accuracy@1
      value: 0.05798840231953609
      name: Dot Accuracy@1
    - type: dot_accuracy@8
      value: 0.14057188562287543
      name: Dot Accuracy@8
    - type: dot_accuracy@50
      value: 0.28234353129374123
      name: Dot Accuracy@50
    - type: dot_accuracy@100
      value: 0.367126574685063
      name: Dot Accuracy@100
    - type: dot_precision@1
      value: 0.05798840231953609
      name: Dot Precision@1
    - type: dot_precision@8
      value: 0.017571485702859428
      name: Dot Precision@8
    - type: dot_precision@50
      value: 0.005646870625874825
      name: Dot Precision@50
    - type: dot_precision@100
      value: 0.00367126574685063
      name: Dot Precision@100
    - type: dot_recall@1
      value: 0.05798840231953609
      name: Dot Recall@1
    - type: dot_recall@8
      value: 0.14057188562287543
      name: Dot Recall@8
    - type: dot_recall@50
      value: 0.28234353129374123
      name: Dot Recall@50
    - type: dot_recall@100
      value: 0.367126574685063
      name: Dot Recall@100
    - type: dot_ndcg@10
      value: 0.09982067360776156
      name: Dot Ndcg@10
    - type: dot_mrr@10
      value: 0.08341625325728497
      name: Dot Mrr@10
    - type: dot_map@100
      value: 0.09054371719801727
      name: Dot Map@100
    - type: query_active_dims
      value: 4085.432861328125
      name: Query Active Dims
    - type: query_sparsity_ratio
      value: 0.9731108304725139
      name: Query Sparsity Ratio
    - type: corpus_active_dims
      value: 4083.059326171875
      name: Corpus Active Dims
    - type: corpus_sparsity_ratio
      value: 0.9731264524130432
      name: Corpus Sparsity Ratio
    - type: avg_flops
      value: 1038.0469970703125
      name: Avg Flops
---

# Sparse Encoder

This is a [Sparse Encoder](https://www.sbert.net/docs/sparse_encoder/usage/usage.html) model trained on the json dataset using the [sentence-transformers](https://www.SBERT.net) library. It maps sentences & paragraphs to a 151936-dimensional sparse vector space   and can be used for semantic search and sparse retrieval.
## Model Details

### Model Description
- **Model Type:** Sparse Encoder
<!-- - **Base model:** [Unknown](https://huggingface.co/unknown) -->
- **Maximum Sequence Length:** 512 tokens
- **Output Dimensionality:** 151936 dimensions
- **Similarity Function:** Dot Product
- **Training Dataset:**
    - json
<!-- - **Language:** Unknown -->
<!-- - **License:** Unknown -->

### Model Sources

- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Documentation:** [Sparse Encoder Documentation](https://www.sbert.net/docs/sparse_encoder/usage/usage.html)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
- **Hugging Face:** [Sparse Encoders on Hugging Face](https://huggingface.co/models?library=sentence-transformers&other=sparse-encoder)

### Full Model Architecture

```
SparseEncoder(
  (0): MLMTransformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'Qwen3ForEmbeddingFusedMeanpool'})
  (1): SpladePooling({'pooling_strategy': 'mean', 'activation_function': 'log1p_relu', 'word_embedding_dimension': 151936})
)
```

## Usage

### Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

```bash
pip install -U sentence-transformers
```

Then you can load this model and run inference.
```python
from sentence_transformers import SparseEncoder

# Download from the 🤗 Hub
model = SparseEncoder("sparse_encoder_model_id")
# Run inference
queries = [
    "This review is for the newcomer to DSLR\u0027s and not the advanced user. So we don\u0027t need to hear the experts bashing on how someone doesn\u0027t know what they are doing and shoudn\u0027t get a camera like this. I am speaking for the person who needs a camera to capture their kids in the school play, soccer game etc. and can\u0027t capture the moment with a point and shoot. You can\u0027t expect awesome pictures if you do not learn how to use all the settings, features, and purpose of an SLR camera. You might even get worse pictures than your point and shoot. Sure an SLR has an auto feature where it will figure what is the best settings to use for a basic picture, but you also end up with a basic picture.  But even all said I would still recommend an SLR and a good zoom lens that you can afford. It is worth spending the money because you can never recapture that moment of your child or whatever your trying to capture.  How many pictures have you taken with a point and shoot that were horrible because they were too dark or blurry? Sure they are fine if the lighting is good and you are within 10 feet or so away. Any further than that well, why bother. The larger zoom type cameras are better but are still limited when lighting is an issue. The pocket type point and shoots usually have a lag time between shots or flash charging up again. So forget about getting off several shots in succesion. Hard to capture a picture if the object is in motion also. The larger zoom types have mimimal lag time and some can even fire off bursts of 3 to 5 in a second with one push of the button and better at capturing something in motion. But they still can\u0027t beat an DSLR. I have gotten shots of my children with an DSLR with low lighting and no flash from many rows of seats away and they were great. The image quality was even good enough where I can crop it and still look good. People using their camera phones crack me up, but that\u0027s another story.  I am not going to get into all the technicalities of this camera. You can read the other reviews for that. I just want to mention that if you are one of those people who need to step up to an SLR because the point and shoot couldn\u0027t do the job but don\u0027t want to become the serious photographer then this inforomation might be what you are looking for.  I find the XSI pictures are soft and not crisp. Some photographers prefer this. The new Nikon DSLR\u0027s take crisper pictures, and they are less expensive.  The main con of the XSI is the focusing points. If you use the camera in auto mode then it will focus on the nearest or most identifiable focus points it can focus on. So it might focus on an object in front of the actual thing your trying to capture that is behind it, or on a purse or print on a shirt for example instead of the persons face or what not. If your trying to get a group picture with the guest of honor in the middle of it, you might not have that person in full focus but instead the preople around because the cameras focus is focusing on the most vivid or closest points. You can set the camera in one of the manual settings and pick one focus point of your choice, like center, top, left, right etc. but you can\u0027t pick all on. In auto it will focus on more than one but like I mentioned before, not excactly on what you want. you can press the shutter button half way and it will try and refocus on different points or you can get closer to your subject and press the shutter button half way and hold it so it keeps that focus point and then you step back and snap the picture. Well you don\u0027t have the luxury of time on your hands for those kid or spontaneous shots.  So this is where I recommend the newer Nikon DSLR\u0027s. Their focus points I find tend to focus on more points and usually on the center also where my subject tends to be. The XSI was here and there. If the area your focusing on is busy, the XSI will pick a few focus points here and there were it thinks are the best. But with the Nikon, I found it to focus on more points which is nice because the whole image will be more sharper instead of certain areas of it when cropping or zooming the image on a computer. Unless you are going for the \"bokeh\" effect in which the subject is in focus and the area around it is blurred.  A camera of this size isn\u0027t always convienient to carry around to all occasions because of it\u0027s size, accessories, worries, or whatever the reason. If you already have a pocket point and shoot or zoom camera, don\u0027t get rid of them. I keep them for when I go on vacation because I don\u0027t want to carry a big DSLR around on vacation. I use the pocket one for when we go out and I don\u0027t want to carry anything bulky and for those you never know shots. I use the zoom one for the amusement parks. It\u0027s small enough where it is still manageable where I can toss in my backpack. I use that one for the amusement parks because I can get pictures of the kids on rides. The zoom and speed is adequate enough where I can get pictures of the kids in the distance or when their zooming down the roller coaster. One day I might get rid of my pocket one and get a waterproof one, that way I have all my bases covered. Ha ha.  But when you learn how to use a DSLR\u0027s features coupled with a decent lens, man the pictures are like night and day.  If you still find that a DSLR is just too much over kill for you then you might want to check out the Nikon S8100. It is a little higher end point and shoot that is a little larger than your average point and shoot but it takes amazing pictures at a distance and in low lighting and has HD video capture with stereo. My nephew has the older S8000 model of this and I was blown away at the pictures and videos he took at my childs play. And we were sitting way back. They made improvements on the S8100 where it counted so I can\u0027t imganine how good it would be now, wow.  I would recommend you try out different cameras and brands in the store and decide which one suits you. Look for optical zoom, optical image stabilization, and good low light capability.",
]
documents = [
    "Mail Boss makes several different models that look identical so pay close attention of the model names. I looked at the Architectural Oasis Jr., Architectural Oasis trilock and the Mail Boss Mail Manager. A person on youtube  demonstrated how easily he was able to manipulate his arm into the mail chute and fish out a package and them demonstrated easily and quickly he jammed a large screwdriver into the edge of the lower door and popped it open shattering the lock. The trilock is the reinforced version of the Jr. It has an upgraded locking system where the Jr. only had a single larch whereas the trilock has a latch and a metal rod on each side of the door that creates three locking points. The Jr.'s door was a simple flat sheet of metal that didn't go all the way to the surrounding edges so it is vulnerable to prying open. The trilock's door sides go all the way to the edge and is flanged for strength and the top edge is tapered to fit under the top door to prevent a screwdriver or something to get behind it.  I mention the above to give insight as to why I chose the Mail Manager. The Jr. was way too easy to pry open so it dropped out of consideration automatically. The trilock seemed just as tamper proof as the Manager as far as prying open is. The ease of fishing mail out of the Jr. and the trillock with your arm is questionable as I was not able to get my arm in like the person on youtube.  The conditions have to be perfect for someone to be able to fish out the mail. The person has to have skinny arms and the mailbox has to be low enough for the person to contort their arm and then reach down. With the Manager the door is smaller thus creating a smaller opening. The Jr, and trilock's doors are larger but there is about a 2 inch plate that hangs on the inside of the box that prevents one from reaching straight into the box and you have to contort around it.  The Jr. and trilock had rubber gaskets around the doors which is nice if you live in an area with heavy rain and wind. The Manager had some mention that their mail gets wet but that is easily addressable by getting thin adhesive gasket strips. The trilock and the Manager in my opinion were about equally secure in tamper resistance and build quality. The Manager and Jr. were about the same price and the trilock was about $20.00 more. The Manager had a little less capacity. The main thing that led me to get the Manager over the trilock was the ability to accept larger packages. It looks like the trilock should be able to accept larger packages because it has a bigger door and the Manager's door is smaller but that is not the case because of the door opening and obstruction I mentioned earlier. All three boxes were only able to accept a package that was about 8 1/2 x 4 3/4 x 3. With the Jr. and trilock the package had to be placed a certain way because of the obstruction on the inside. If you take off the mail clip on the inside of the doors then you can increase the package size to about 10x4 3/4x3. If your getting a locking mailbox then you are worried about mail theft so why would you place outgoing mail clipped to the unlocked upper door anyways. Back to the package capacity. The Jr. and trillock were maxed out in package size unless it is a flat envelope package that you can shove past the overhang with the door closed part way giving you the angle. The manager on the other hand does not have this overhang other than a slight lip so you basically have a clear pass through where you can roughly get a 10 x 15 x 1 1/4 package in. The other notable pluses for the Manager is that it comes with 4 bolts and a specialized drill bit for installation whereas the other two you had to supply your own hardware. And lastly the Jr. and trilock came with two large keys where the Manager came with three small keys. There are bigger ones in the Mail Boss line but they run around a $100.00 more. One negative about the Mail Manager is that it is shallower and slightly narrower than the Jr. And trillock so when then mailman delivers large junk mail like advertisements they can get hung up on the inside lower lip of the door because they are too wide for the box and the box isn't deep enough for gravity to have it fall in completely because the portion of mail on the inside is touching the bottom of the box along with the other end propped up on the door lip and the sides of the junk mail pressing on the sides. If your mailman is good, then all he has to do is give a light nudge and the mail will fall in. The Jr. and trilock offer a slightly wider opening and has a deeper collection area so gravity can do it's work.",
    'This is a really great book and I highly recommend it, both as a compelling autobiography and as a way to learn about Islam and Islamism.  One thing I learned from this book is the tremendous variation within Islam. The distinction that was most important to me was the difference between Religious (or Spiritual) Islam and Political Islam.  The values and social ideals of Religious Islam are virtually identical to Christiaity and Buddhism (the two world religion with which I am most familiar).  The emphasis on care for the poor, the needy, and less fortunate is indistinguishable among these religions.  Further, Islamic mysticism describes Allah in the same sacred language used by mystics of all stripes.  This blends well with our religious heritage and adds new inspiring stories and language that will benefit all with an open mind.  On the other side is Political Islam, it\'s hatred of the West, it\'s lack of sound Islamic scholarship and spirituality, it\'s bigotry and shameful treatment of women, it\'s dedication to an Islamic State, and it\'s willingness to kill in order to attain it.  This form of Islam, and I hesitate even calling it Islam given it\'s shallow scholarship and deviance from fundamental Islamic values and teachings, breeds extremism and terrorism.  This form of Islam poses a concrete threat to America and other Western countries.  Within our country, as well as countries throughout the world, this twisted form of Islam is taught, primarily, by missionaries from Saudia Arabia.  They have combined a rigid, severe, and literalist version of Islam that dates to the mid 1700s with the political goal of transforming the world into a single Islamic state.  They are True Believers of the worst kind and the foundation on which Islamic terrorists stand.  Note:  After reading The Islamist and writing the review above, I read Islam by <PERSON>, an excellent overview of Islam.  One point that he makes is that Islam is not just a religion, but a way of life, and that Islam permeates all aspects of the Muslim,s life, including politics.  Because this applies to all forms of Islam, the distinction made in The Islamist, and repeated in my review, between "Religious Islam" and "Political Islam" is not correct, or at least not precise.  Other labels also have there shortcomings, so I will let the wording used by the author stand.  Nonetheless, The Islamist remains and excellent, powerful, and intensely personal work.',
    "I recently bought a new embroidery machine as an upgrade my old entry level machine.  I thought I'd upgrade my thread as well.  The reviews for this thread seemed very positive.  I recently learned that embroidery thread has a shelf life and it was appealing that the thread was protected so well by the plastic dome and the plug.  I purchased several colors and intended to use this thread exclusively in my new machine.  Unfortunately during my first project with the new machine, this thread was breaking at a discouraging rate.  I thought something was wrong with the machine.  I actually brought it back to the dealer a couple of times but when they tested it, they had no thread breakage.  Of course, they were not using Hemingworth thread.  Nobody thought it was the thread so I would bring the machine home and experience breakage again.  In desperation, I brought the machine back to the dealer and took along my thread, hoop, stabilizer, and one of the designs that had given me grief with a lot of thread breakage.  I spent two hours with the dealer's technician.  The technician examined the machine and could find nothing wrong with it.  We then sewed my design with my Hemingworth red thread.  The thread broke again and again.  We ended up doing a lot of testing with various weight bobbin thread, tension settings, threads, needles, etc..  The conclusion was that it was the Hemingworth thread causing the issues.  The technician who patiently sat with me thought the thread felt heavy and sticky.  He said it was probably old thread from the way it behaved.  What I couldn't argue with was that every time we rethreaded with another brand of thread and sewed the same design under all the same conditions, there was no upper thread breakage.  I'm not happy that I purchased several Hemingworth spools that are now of no use to me but I'm thankful that I didn't purchase any of the large sets I was considering.",
]
query_embeddings = model.encode_query(queries)
document_embeddings = model.encode_document(documents)
print(query_embeddings.shape, document_embeddings.shape)
# [1, 151936] [3, 151936]

# Get the similarity scores for the embeddings
similarities = model.similarity(query_embeddings, document_embeddings)
print(similarities)
# tensor([[23.6560, 13.6671, 16.5997]])
```

<!--
### Direct Usage (Transformers)

<details><summary>Click to see the direct usage in Transformers</summary>

</details>
-->

<!--
### Downstream Usage (Sentence Transformers)

You can finetune this model on your own dataset.

<details><summary>Click to expand</summary>

</details>
-->

<!--
### Out-of-Scope Use

*List how the model may foreseeably be misused and address what users ought not to do with the model.*
-->

## Evaluation

### Metrics

#### Sparse Information Retrieval

* Dataset: `sparse-ir-eval`
* Evaluated with [<code>SparseInformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sparse_encoder/evaluation.html#sentence_transformers.sparse_encoder.evaluation.SparseInformationRetrievalEvaluator) with these parameters:
  ```json
  {
      "max_active_dims": 4096
  }
  ```

| Metric                | Value      |
|:----------------------|:-----------|
| dot_accuracy@1        | 0.058      |
| dot_accuracy@8        | 0.1406     |
| dot_accuracy@50       | 0.2823     |
| dot_accuracy@100      | 0.3671     |
| dot_precision@1       | 0.058      |
| dot_precision@8       | 0.0176     |
| dot_precision@50      | 0.0056     |
| dot_precision@100     | 0.0037     |
| dot_recall@1          | 0.058      |
| dot_recall@8          | 0.1406     |
| dot_recall@50         | 0.2823     |
| dot_recall@100        | 0.3671     |
| **dot_ndcg@10**       | **0.0998** |
| dot_mrr@10            | 0.0834     |
| dot_map@100           | 0.0905     |
| query_active_dims     | 4085.4329  |
| query_sparsity_ratio  | 0.9731     |
| corpus_active_dims    | 4083.0593  |
| corpus_sparsity_ratio | 0.9731     |
| avg_flops             | 1038.047   |

<!--
## Bias, Risks and Limitations

*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
-->

<!--
### Recommendations

*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
-->

## Training Details

### Training Dataset

#### json

* Dataset: json
* Size: 404,856 training samples
* Columns: <code>query</code> and <code>positive</code>
* Approximate statistics based on the first 1000 samples:
  |         | query                                                                                 | positive                                                                              |
  |:--------|:--------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------|
  | type    | string                                                                                | string                                                                                |
  | details | <ul><li>min: 394 tokens</li><li>mean: 495.05 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 394 tokens</li><li>mean: 495.05 tokens</li><li>max: 512 tokens</li></ul> |
* Samples:
  | query                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | positive                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
  |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
  | <code>This knife, and theJ.A. Henckels Twin Four Star 3-Inch High Carbon Stainless-Steel Paring Knifeare the two best knives in the Henckels four star collection.  There is something "just so" about them.  They are just right, with all the various design parameters coming together to create a whole that is greater than the sum of the parts.  This serrated utility knife works well in a great variety of applications.  The five inch serrated blade is nicely thin (but still thick enough for good strength and rigidity) and shallow (i.e. not broad).  I find it very useful for cutting pie or cake or brownies, as well as (of course) bread and tomatoes and many other vegetables.  I've had this knife now for SIXTEEN YEARS, and it is still going strong, and still one of my favorites.  However . . . you must SHARPEN this knife eventually.  Like any other knife, it will go dull.  NEVER HONE THIS KNIFE OR ANY OTHER SERRATED KNIFE!  A sharpening steel is too large a diameter to be used on a serrated knife....</code> | <code>When I moved into my first (and current) house from my apartment, the previous owner had a Whirlpool (Ecodyne) WHER25 reverse osmosis system installed under the kitchen sink.  I liked the water the system produced, but the flow control was misfunctioning, causing an annoying dripping sound that was almost constant.  The installer (previous owner, not a plumber) had NOT made the common mistake of trimming out the flow control--which was the first thing I suspected.  No, the problem, rather, was deformation of the thin rubber membranes (there are two) inside the head of the unit.  I flipped them over (they are reversible) and this fixed the problem for a month or so, but it returned.  I priced out new membranes/gaskets and flow control insert, with shipping, and decided that I should just start fresh with a whole new unit, since it was on a special sale locally and it would come with all new filters ($80 worth).  I replaced just the head and all was well for a while.  Then the tank stopp...</code> |
  | <code>When I moved into my first (and current) house from my apartment, the previous owner had a Whirlpool (Ecodyne) WHER25 reverse osmosis system installed under the kitchen sink.  I liked the water the system produced, but the flow control was misfunctioning, causing an annoying dripping sound that was almost constant.  The installer (previous owner, not a plumber) had NOT made the common mistake of trimming out the flow control--which was the first thing I suspected.  No, the problem, rather, was deformation of the thin rubber membranes (there are two) inside the head of the unit.  I flipped them over (they are reversible) and this fixed the problem for a month or so, but it returned.  I priced out new membranes/gaskets and flow control insert, with shipping, and decided that I should just start fresh with a whole new unit, since it was on a special sale locally and it would come with all new filters ($80 worth).  I replaced just the head and all was well for a while.  Then the tank stopp...</code> | <code>This knife, and theJ.A. Henckels Twin Four Star 3-Inch High Carbon Stainless-Steel Paring Knifeare the two best knives in the Henckels four star collection.  There is something "just so" about them.  They are just right, with all the various design parameters coming together to create a whole that is greater than the sum of the parts.  This serrated utility knife works well in a great variety of applications.  The five inch serrated blade is nicely thin (but still thick enough for good strength and rigidity) and shallow (i.e. not broad).  I find it very useful for cutting pie or cake or brownies, as well as (of course) bread and tomatoes and many other vegetables.  I've had this knife now for SIXTEEN YEARS, and it is still going strong, and still one of my favorites.  However . . . you must SHARPEN this knife eventually.  Like any other knife, it will go dull.  NEVER HONE THIS KNIFE OR ANY OTHER SERRATED KNIFE!  A sharpening steel is too large a diameter to be used on a serrated knife....</code> |
  | <code>The Good: Sawstop customer service is the best I have dealt with in years. When set up correctly it cuts sheet good like a dream. Only a panel saw would seem better. The adjustable stops are stout and easy to use. Great for repeat cuts. Sliding mechanism is very smooth  The Bad: No postive stops - in my experience this borders on being a huge problem for two reasons. First, it is not easy to get the fence square to the blade if you want to be very accurate. On the best of days it takes me 5 minutes to get it close enough to make a 48" cut square. Without positive stops I have to square the sliding table fence every time it is bumped or removed. And, I remove it regularly as the sliding table fence sits close enough to the blade that almost all cuts over 48" using the regular saw fence demand the sliding table fence be removed or swung out of the way (if the cut is less than 48" the sliding table can be moved back with fence in place forming a little pocket to work within). The fence th...</code> | <code>The Bad (yes there are a lot of bad things even with 5 stars): One of the worst written non fiction books I have ever owned. I really don't care if one of the authors clients liked a sauce so well that she would eat it over kitty litter. I don't care to read 100s of testaments to how good the recipes are (they are pretty good). I just want to get on with the book. Prove the recipes are not good. Don't spend all those pages trying to convince me. It even backfired. I was sure they were going to be terrible are reading all the testaments.  Get ready to cook. A lot. Get ready to do a lot of dishes.  Have to plan ahead. Have to make lunches the night before often. The flax seed breakfast takes some work and time. Can't just whip it up. If you run out without having already prepared more you will find yourself without a breakfast.  Terribly organized. It is not sequential. You will read something and then find out later you were not suppose to do it when you did unless you read the entire b...</code> |
* Loss: <code>model.SpladeMixedTopKLoss.SpladeMixedTopKLoss</code> with these parameters:
  ```json
  {
      "loss": "SparseMultipleNegativesRankingLoss(scale=1.0, similarity_fct='dot_score', gather_across_devices=False)",
      "document_regularizer_weight": 0.005,
      "query_regularizer_weight": 0.005,
      "document_regularizer_threshold": 256,
      "query_regularizer_threshold": 256
  }
  ```

### Training Hyperparameters
#### Non-Default Hyperparameters

- `eval_strategy`: steps
- `per_device_train_batch_size`: 64
- `weight_decay`: 0.0001
- `num_train_epochs`: 1
- `lr_scheduler_type`: cosine
- `warmup_ratio`: 0.15
- `save_only_model`: True
- `bf16`: True
- `dataloader_num_workers`: 8
- `gradient_checkpointing`: True

#### All Hyperparameters
<details><summary>Click to expand</summary>

- `overwrite_output_dir`: False
- `do_predict`: False
- `eval_strategy`: steps
- `prediction_loss_only`: True
- `per_device_train_batch_size`: 64
- `per_device_eval_batch_size`: 8
- `per_gpu_train_batch_size`: None
- `per_gpu_eval_batch_size`: None
- `gradient_accumulation_steps`: 1
- `eval_accumulation_steps`: None
- `torch_empty_cache_steps`: None
- `learning_rate`: 5e-05
- `weight_decay`: 0.0001
- `adam_beta1`: 0.9
- `adam_beta2`: 0.999
- `adam_epsilon`: 1e-08
- `max_grad_norm`: 1
- `num_train_epochs`: 1
- `max_steps`: -1
- `lr_scheduler_type`: cosine
- `lr_scheduler_kwargs`: {}
- `warmup_ratio`: 0.15
- `warmup_steps`: 0
- `log_level`: passive
- `log_level_replica`: warning
- `log_on_each_node`: True
- `logging_nan_inf_filter`: True
- `save_safetensors`: True
- `save_on_each_node`: False
- `save_only_model`: True
- `restore_callback_states_from_checkpoint`: False
- `no_cuda`: False
- `use_cpu`: False
- `use_mps_device`: False
- `seed`: 42
- `data_seed`: None
- `jit_mode_eval`: False
- `bf16`: True
- `fp16`: False
- `fp16_opt_level`: O1
- `half_precision_backend`: auto
- `bf16_full_eval`: False
- `fp16_full_eval`: False
- `tf32`: None
- `local_rank`: 0
- `ddp_backend`: None
- `tpu_num_cores`: None
- `tpu_metrics_debug`: False
- `debug`: []
- `dataloader_drop_last`: False
- `dataloader_num_workers`: 8
- `dataloader_prefetch_factor`: None
- `past_index`: -1
- `disable_tqdm`: False
- `remove_unused_columns`: True
- `label_names`: None
- `load_best_model_at_end`: False
- `ignore_data_skip`: False
- `fsdp`: []
- `fsdp_min_num_params`: 0
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
- `fsdp_transformer_layer_cls_to_wrap`: None
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
- `parallelism_config`: None
- `deepspeed`: None
- `label_smoothing_factor`: 0.0
- `optim`: adamw_torch_fused
- `optim_args`: None
- `adafactor`: False
- `group_by_length`: False
- `length_column_name`: length
- `project`: huggingface
- `trackio_space_id`: trackio
- `ddp_find_unused_parameters`: None
- `ddp_bucket_cap_mb`: None
- `ddp_broadcast_buffers`: False
- `dataloader_pin_memory`: True
- `dataloader_persistent_workers`: False
- `skip_memory_metrics`: True
- `use_legacy_prediction_loop`: False
- `push_to_hub`: False
- `resume_from_checkpoint`: None
- `hub_model_id`: None
- `hub_strategy`: every_save
- `hub_private_repo`: None
- `hub_always_push`: False
- `hub_revision`: None
- `gradient_checkpointing`: True
- `gradient_checkpointing_kwargs`: None
- `include_inputs_for_metrics`: False
- `include_for_metrics`: []
- `eval_do_concat_batches`: True
- `fp16_backend`: auto
- `push_to_hub_model_id`: None
- `push_to_hub_organization`: None
- `mp_parameters`: 
- `auto_find_batch_size`: False
- `full_determinism`: False
- `torchdynamo`: None
- `ray_scope`: last
- `ddp_timeout`: 1800
- `torch_compile`: False
- `torch_compile_backend`: None
- `torch_compile_mode`: None
- `include_tokens_per_second`: False
- `include_num_input_tokens_seen`: no
- `neftune_noise_alpha`: None
- `optim_target_modules`: None
- `batch_eval_metrics`: False
- `eval_on_start`: False
- `use_liger_kernel`: False
- `liger_kernel_config`: None
- `eval_use_gather_object`: False
- `average_tokens_across_devices`: True
- `prompts`: None
- `batch_sampler`: batch_sampler
- `multi_dataset_batch_sampler`: proportional
- `router_mapping`: {}
- `learning_rate_mapping`: {}

</details>

### Training Logs
<details><summary>Click to expand</summary>

| Epoch  | Step | Training Loss | sparse-ir-eval_dot_ndcg@10 |
|:------:|:----:|:-------------:|:--------------------------:|
| 0.0032 | 20   | 3756.6391     | -                          |
| 0.0063 | 40   | 3881.184      | -                          |
| 0.0095 | 60   | 3375.248      | -                          |
| 0.0126 | 80   | 2256.2281     | -                          |
| 0.0158 | 100  | 1877.1715     | -                          |
| 0.0190 | 120  | 996.4254      | -                          |
| 0.0221 | 140  | 436.1054      | -                          |
| 0.0253 | 160  | 115.5228      | -                          |
| 0.0285 | 180  | 24.6653       | -                          |
| 0.0316 | 200  | 4.2           | -                          |
| 0.0348 | 220  | 2.9732        | -                          |
| 0.0379 | 240  | 2.5846        | -                          |
| 0.0411 | 260  | 2.3334        | -                          |
| 0.0443 | 280  | 2.3134        | -                          |
| 0.0474 | 300  | 2.1832        | -                          |
| 0.0506 | 320  | 2.2094        | -                          |
| 0.0537 | 340  | 2.1111        | -                          |
| 0.0569 | 360  | 2.0669        | -                          |
| 0.0601 | 380  | 1.9633        | -                          |
| 0.0632 | 400  | 1.8683        | -                          |
| 0.0664 | 420  | 1.8812        | -                          |
| 0.0696 | 440  | 1.8815        | -                          |
| 0.0727 | 460  | 1.801         | -                          |
| 0.0759 | 480  | 1.7649        | -                          |
| 0.0790 | 500  | 1.7384        | -                          |
| 0.0822 | 520  | 1.681         | -                          |
| 0.0854 | 540  | 1.618         | -                          |
| 0.0885 | 560  | 1.6502        | -                          |
| 0.0917 | 580  | 1.5538        | -                          |
| 0.0948 | 600  | 1.59          | -                          |
| 0.0980 | 620  | 1.6011        | -                          |
| 0.1012 | 640  | 1.619         | -                          |
| 0.1043 | 660  | 1.5551        | -                          |
| 0.1075 | 680  | 1.543         | -                          |
| 0.1107 | 700  | 1.5369        | -                          |
| 0.1138 | 720  | 1.4984        | -                          |
| 0.1170 | 740  | 1.5329        | -                          |
| 0.1201 | 760  | 1.4913        | -                          |
| 0.1233 | 780  | 1.4744        | -                          |
| 0.1265 | 800  | 1.4889        | -                          |
| 0.1296 | 820  | 1.4597        | -                          |
| 0.1328 | 840  | 1.4809        | -                          |
| 0.1359 | 860  | 1.5088        | -                          |
| 0.1391 | 880  | 1.524         | -                          |
| 0.1423 | 900  | 1.4924        | -                          |
| 0.1454 | 920  | 1.402         | -                          |
| 0.1486 | 940  | 1.3694        | -                          |
| 0.1518 | 960  | 1.3744        | -                          |
| 0.1549 | 980  | 1.463         | -                          |
| 0.1581 | 1000 | 1.4266        | -                          |
| 0.1612 | 1020 | 1.4474        | -                          |
| 0.1644 | 1040 | 1.4167        | -                          |
| 0.1676 | 1060 | 1.4221        | -                          |
| 0.1707 | 1080 | 1.373         | -                          |
| 0.1739 | 1100 | 1.373         | -                          |
| 0.1770 | 1120 | 1.4101        | -                          |
| 0.1802 | 1140 | 1.2815        | -                          |
| 0.1834 | 1160 | 1.2742        | -                          |
| 0.1865 | 1180 | 1.3225        | -                          |
| 0.1897 | 1200 | 1.3122        | -                          |
| 0.1929 | 1220 | 1.2834        | -                          |
| 0.1960 | 1240 | 1.371         | -                          |
| 0.1992 | 1260 | 1.2451        | -                          |
| 0.2023 | 1280 | 1.3268        | -                          |
| 0.2055 | 1300 | 1.2299        | -                          |
| 0.2087 | 1320 | 1.3697        | -                          |
| 0.2118 | 1340 | 1.3479        | -                          |
| 0.2150 | 1360 | 1.2536        | -                          |
| 0.2181 | 1380 | 1.2595        | -                          |
| 0.2213 | 1400 | 1.2496        | -                          |
| 0.2245 | 1420 | 1.2979        | -                          |
| 0.2276 | 1440 | 1.2197        | -                          |
| 0.2308 | 1460 | 1.2328        | -                          |
| 0.2340 | 1480 | 1.2163        | -                          |
| 0.2371 | 1500 | 1.1419        | -                          |
| 0.2403 | 1520 | 1.2274        | -                          |
| 0.2434 | 1540 | 1.2009        | -                          |
| 0.2466 | 1560 | 1.2072        | -                          |
| 0.2498 | 1580 | 1.1109        | -                          |
| 0.2529 | 1600 | 1.1361        | -                          |
| 0.2561 | 1620 | 1.1738        | -                          |
| 0.2592 | 1640 | 1.2371        | -                          |
| 0.2624 | 1660 | 1.2902        | -                          |
| 0.2656 | 1680 | 1.2095        | -                          |
| 0.2687 | 1700 | 1.1699        | -                          |
| 0.2719 | 1720 | 1.2076        | -                          |
| 0.2751 | 1740 | 1.0683        | -                          |
| 0.2782 | 1760 | 1.1479        | -                          |
| 0.2814 | 1780 | 1.0934        | -                          |
| 0.2845 | 1800 | 1.2092        | -                          |
| 0.2877 | 1820 | 1.1907        | -                          |
| 0.2909 | 1840 | 1.1763        | -                          |
| 0.2940 | 1860 | 1.1401        | -                          |
| 0.2972 | 1880 | 1.1643        | -                          |
| 0.3003 | 1900 | 1.1495        | -                          |
| 0.3035 | 1920 | 1.1115        | -                          |
| 0.3067 | 1940 | 1.1425        | -                          |
| 0.3098 | 1960 | 1.1125        | -                          |
| 0.3130 | 1980 | 1.1415        | -                          |
| 0.3162 | 2000 | 0.9995        | 0.0848                     |
| 0.3193 | 2020 | 1.0974        | -                          |
| 0.3225 | 2040 | 1.1384        | -                          |
| 0.3256 | 2060 | 1.1301        | -                          |
| 0.3288 | 2080 | 1.0622        | -                          |
| 0.3320 | 2100 | 1.106         | -                          |
| 0.3351 | 2120 | 1.1544        | -                          |
| 0.3383 | 2140 | 1.077         | -                          |
| 0.3414 | 2160 | 1.0385        | -                          |
| 0.3446 | 2180 | 1.0091        | -                          |
| 0.3478 | 2200 | 1.0553        | -                          |
| 0.3509 | 2220 | 1.1105        | -                          |
| 0.3541 | 2240 | 1.08          | -                          |
| 0.3573 | 2260 | 1.0693        | -                          |
| 0.3604 | 2280 | 1.027         | -                          |
| 0.3636 | 2300 | 0.9827        | -                          |
| 0.3667 | 2320 | 1.0395        | -                          |
| 0.3699 | 2340 | 1.0946        | -                          |
| 0.3731 | 2360 | 1.0235        | -                          |
| 0.3762 | 2380 | 1.0376        | -                          |
| 0.3794 | 2400 | 1.0803        | -                          |
| 0.3825 | 2420 | 1.0266        | -                          |
| 0.3857 | 2440 | 1.0295        | -                          |
| 0.3889 | 2460 | 1.0816        | -                          |
| 0.3920 | 2480 | 1.0421        | -                          |
| 0.3952 | 2500 | 1.0067        | -                          |
| 0.3984 | 2520 | 0.9935        | -                          |
| 0.4015 | 2540 | 0.9707        | -                          |
| 0.4047 | 2560 | 1.0572        | -                          |
| 0.4078 | 2580 | 0.9424        | -                          |
| 0.4110 | 2600 | 1.038         | -                          |
| 0.4142 | 2620 | 1.056         | -                          |
| 0.4173 | 2640 | 0.9943        | -                          |
| 0.4205 | 2660 | 0.9619        | -                          |
| 0.4236 | 2680 | 1.0193        | -                          |
| 0.4268 | 2700 | 0.9485        | -                          |
| 0.4300 | 2720 | 0.93          | -                          |
| 0.4331 | 2740 | 0.9073        | -                          |
| 0.4363 | 2760 | 1.0188        | -                          |
| 0.4395 | 2780 | 1.0057        | -                          |
| 0.4426 | 2800 | 0.8854        | -                          |
| 0.4458 | 2820 | 0.9935        | -                          |
| 0.4489 | 2840 | 0.9777        | -                          |
| 0.4521 | 2860 | 0.9265        | -                          |
| 0.4553 | 2880 | 0.9509        | -                          |
| 0.4584 | 2900 | 0.9526        | -                          |
| 0.4616 | 2920 | 0.8809        | -                          |
| 0.4647 | 2940 | 0.9624        | -                          |
| 0.4679 | 2960 | 0.9414        | -                          |
| 0.4711 | 2980 | 0.9497        | -                          |
| 0.4742 | 3000 | 0.9158        | -                          |
| 0.4774 | 3020 | 0.8677        | -                          |
| 0.4806 | 3040 | 0.9295        | -                          |
| 0.4837 | 3060 | 0.9222        | -                          |
| 0.4869 | 3080 | 0.9325        | -                          |
| 0.4900 | 3100 | 0.8811        | -                          |
| 0.4932 | 3120 | 0.9919        | -                          |
| 0.4964 | 3140 | 0.9024        | -                          |
| 0.4995 | 3160 | 0.8907        | -                          |
| 0.5027 | 3180 | 0.9365        | -                          |
| 0.5058 | 3200 | 0.9972        | -                          |
| 0.5090 | 3220 | 0.9611        | -                          |
| 0.5122 | 3240 | 0.9248        | -                          |
| 0.5153 | 3260 | 0.9298        | -                          |
| 0.5185 | 3280 | 0.9047        | -                          |
| 0.5217 | 3300 | 0.8981        | -                          |
| 0.5248 | 3320 | 0.9552        | -                          |
| 0.5280 | 3340 | 0.8675        | -                          |
| 0.5311 | 3360 | 0.9345        | -                          |
| 0.5343 | 3380 | 0.8429        | -                          |
| 0.5375 | 3400 | 0.8839        | -                          |
| 0.5406 | 3420 | 0.9124        | -                          |
| 0.5438 | 3440 | 0.8977        | -                          |
| 0.5469 | 3460 | 0.9338        | -                          |
| 0.5501 | 3480 | 0.8865        | -                          |
| 0.5533 | 3500 | 0.9118        | -                          |
| 0.5564 | 3520 | 0.8961        | -                          |
| 0.5596 | 3540 | 0.9371        | -                          |
| 0.5628 | 3560 | 0.9258        | -                          |
| 0.5659 | 3580 | 0.8354        | -                          |
| 0.5691 | 3600 | 0.895         | -                          |
| 0.5722 | 3620 | 0.8392        | -                          |
| 0.5754 | 3640 | 0.8145        | -                          |
| 0.5786 | 3660 | 0.8824        | -                          |
| 0.5817 | 3680 | 0.8645        | -                          |
| 0.5849 | 3700 | 0.8799        | -                          |
| 0.5880 | 3720 | 0.8537        | -                          |
| 0.5912 | 3740 | 0.818         | -                          |
| 0.5944 | 3760 | 0.8558        | -                          |
| 0.5975 | 3780 | 0.9098        | -                          |
| 0.6007 | 3800 | 0.9256        | -                          |
| 0.6039 | 3820 | 0.8419        | -                          |
| 0.6070 | 3840 | 0.8406        | -                          |
| 0.6102 | 3860 | 0.8835        | -                          |
| 0.6133 | 3880 | 0.8146        | -                          |
| 0.6165 | 3900 | 0.7797        | -                          |
| 0.6197 | 3920 | 0.8241        | -                          |
| 0.6228 | 3940 | 0.85          | -                          |
| 0.6260 | 3960 | 0.866         | -                          |
| 0.6291 | 3980 | 0.9223        | -                          |
| 0.6323 | 4000 | 0.8503        | 0.0995                     |
| 0.6355 | 4020 | 0.8245        | -                          |
| 0.6386 | 4040 | 0.8637        | -                          |
| 0.6418 | 4060 | 0.8433        | -                          |
| 0.6450 | 4080 | 0.7897        | -                          |
| 0.6481 | 4100 | 0.8193        | -                          |
| 0.6513 | 4120 | 0.8269        | -                          |
| 0.6544 | 4140 | 0.8663        | -                          |
| 0.6576 | 4160 | 0.895         | -                          |
| 0.6608 | 4180 | 0.8711        | -                          |
| 0.6639 | 4200 | 0.7989        | -                          |
| 0.6671 | 4220 | 0.7911        | -                          |
| 0.6702 | 4240 | 0.8225        | -                          |
| 0.6734 | 4260 | 0.7779        | -                          |
| 0.6766 | 4280 | 0.8445        | -                          |
| 0.6797 | 4300 | 0.8239        | -                          |
| 0.6829 | 4320 | 0.8112        | -                          |
| 0.6861 | 4340 | 0.7937        | -                          |
| 0.6892 | 4360 | 0.8593        | -                          |
| 0.6924 | 4380 | 0.7969        | -                          |
| 0.6955 | 4400 | 0.7834        | -                          |
| 0.6987 | 4420 | 0.782         | -                          |
| 0.7019 | 4440 | 0.8304        | -                          |
| 0.7050 | 4460 | 0.7929        | -                          |
| 0.7082 | 4480 | 0.7789        | -                          |
| 0.7113 | 4500 | 0.8261        | -                          |
| 0.7145 | 4520 | 0.854         | -                          |
| 0.7177 | 4540 | 0.7974        | -                          |
| 0.7208 | 4560 | 0.8439        | -                          |
| 0.7240 | 4580 | 0.818         | -                          |
| 0.7272 | 4600 | 0.7888        | -                          |
| 0.7303 | 4620 | 0.8877        | -                          |
| 0.7335 | 4640 | 0.8232        | -                          |
| 0.7366 | 4660 | 0.7957        | -                          |
| 0.7398 | 4680 | 0.8362        | -                          |
| 0.7430 | 4700 | 0.753         | -                          |
| 0.7461 | 4720 | 0.9162        | -                          |
| 0.7493 | 4740 | 0.7765        | -                          |
| 0.7525 | 4760 | 0.7998        | -                          |
| 0.7556 | 4780 | 0.7864        | -                          |
| 0.7588 | 4800 | 0.8647        | -                          |
| 0.7619 | 4820 | 0.8372        | -                          |
| 0.7651 | 4840 | 0.8504        | -                          |
| 0.7683 | 4860 | 0.8386        | -                          |
| 0.7714 | 4880 | 0.7621        | -                          |
| 0.7746 | 4900 | 0.7734        | -                          |
| 0.7777 | 4920 | 0.9094        | -                          |
| 0.7809 | 4940 | 0.8194        | -                          |
| 0.7841 | 4960 | 0.8591        | -                          |
| 0.7872 | 4980 | 0.8015        | -                          |
| 0.7904 | 5000 | 0.8049        | -                          |
| 0.7936 | 5020 | 0.7673        | -                          |
| 0.7967 | 5040 | 0.8643        | -                          |
| 0.7999 | 5060 | 0.754         | -                          |
| 0.8030 | 5080 | 0.7749        | -                          |
| 0.8062 | 5100 | 0.7639        | -                          |
| 0.8094 | 5120 | 0.8           | -                          |
| 0.8125 | 5140 | 0.8069        | -                          |
| 0.8157 | 5160 | 0.7975        | -                          |
| 0.8188 | 5180 | 0.764         | -                          |
| 0.8220 | 5200 | 0.842         | -                          |
| 0.8252 | 5220 | 0.8787        | -                          |
| 0.8283 | 5240 | 0.8819        | -                          |
| 0.8315 | 5260 | 0.8338        | -                          |
| 0.8347 | 5280 | 0.8508        | -                          |
| 0.8378 | 5300 | 0.7691        | -                          |
| 0.8410 | 5320 | 0.8458        | -                          |
| 0.8441 | 5340 | 0.7787        | -                          |
| 0.8473 | 5360 | 0.8077        | -                          |
| 0.8505 | 5380 | 0.8098        | -                          |
| 0.8536 | 5400 | 0.8261        | -                          |
| 0.8568 | 5420 | 0.8268        | -                          |
| 0.8599 | 5440 | 0.8459        | -                          |
| 0.8631 | 5460 | 0.7852        | -                          |
| 0.8663 | 5480 | 0.7086        | -                          |
| 0.8694 | 5500 | 0.7699        | -                          |
| 0.8726 | 5520 | 0.8147        | -                          |
| 0.8758 | 5540 | 0.8246        | -                          |
| 0.8789 | 5560 | 0.7893        | -                          |
| 0.8821 | 5580 | 0.8332        | -                          |
| 0.8852 | 5600 | 0.7127        | -                          |
| 0.8884 | 5620 | 0.8047        | -                          |
| 0.8916 | 5640 | 0.8774        | -                          |
| 0.8947 | 5660 | 0.8672        | -                          |
| 0.8979 | 5680 | 0.8259        | -                          |
| 0.9010 | 5700 | 0.776         | -                          |
| 0.9042 | 5720 | 0.8961        | -                          |
| 0.9074 | 5740 | 0.8395        | -                          |
| 0.9105 | 5760 | 0.7953        | -                          |
| 0.9137 | 5780 | 0.8382        | -                          |
| 0.9169 | 5800 | 0.739         | -                          |
| 0.9200 | 5820 | 0.8259        | -                          |
| 0.9232 | 5840 | 0.8383        | -                          |
| 0.9263 | 5860 | 0.8368        | -                          |
| 0.9295 | 5880 | 0.8068        | -                          |
| 0.9327 | 5900 | 0.7138        | -                          |
| 0.9358 | 5920 | 0.8241        | -                          |
| 0.9390 | 5940 | 0.8653        | -                          |
| 0.9421 | 5960 | 0.8305        | -                          |
| 0.9453 | 5980 | 0.7918        | -                          |
| 0.9485 | 6000 | 0.8554        | 0.0998                     |
| 0.9516 | 6020 | 0.825         | -                          |
| 0.9548 | 6040 | 0.7564        | -                          |
| 0.9580 | 6060 | 0.8323        | -                          |
| 0.9611 | 6080 | 0.8104        | -                          |
| 0.9643 | 6100 | 0.7338        | -                          |
| 0.9674 | 6120 | 0.8568        | -                          |
| 0.9706 | 6140 | 0.8711        | -                          |
| 0.9738 | 6160 | 0.8166        | -                          |
| 0.9769 | 6180 | 0.7405        | -                          |
| 0.9801 | 6200 | 0.7245        | -                          |
| 0.9832 | 6220 | 0.7597        | -                          |
| 0.9864 | 6240 | 0.8384        | -                          |
| 0.9896 | 6260 | 0.8407        | -                          |
| 0.9927 | 6280 | 0.8376        | -                          |
| 0.9959 | 6300 | 0.8535        | -                          |
| 0.9991 | 6320 | 0.7716        | -                          |

</details>

### Framework Versions
- Python: 3.11.14
- Sentence Transformers: 5.1.1
- Transformers: 4.57.1
- PyTorch: 2.8.0+cu128
- Accelerate: 1.11.0
- Datasets: 4.2.0
- Tokenizers: 0.22.1

## Citation

### BibTeX

#### Sentence Transformers
```bibtex
@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}
```

#### FlopsLoss
```bibtex
@article{paria2020minimizing,
    title={Minimizing flops to learn efficient sparse representations},
    author={Paria, Biswajit and Yeh, Chih-Kuan and Yen, Ian EH and Xu, Ning and Ravikumar, Pradeep and P{'o}czos, Barnab{'a}s},
    journal={arXiv preprint arXiv:2004.05665},
    year={2020}
}
```

<!--
## Glossary

*Clearly define terms in order to be accessible across audiences.*
-->

<!--
## Model Card Authors

*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
-->

<!--
## Model Card Contact

*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
-->