bwang0911 commited on
Commit
390a981
·
verified ·
1 Parent(s): d5375f2

Add new SentenceTransformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,937 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - generated_from_trainer
7
+ - dataset_size:44978
8
+ - loss:ReasoningGuidedRankingLoss
9
+ base_model: google-bert/bert-base-uncased
10
+ widget:
11
+ - source_sentence: Severe weather rips through Alabama university, takes aim at Southeast
12
+ sentences:
13
+ - The second text provides a detailed elaboration of the first text. It expands
14
+ on the initial statement about severe weather in Alabama, providing specific details
15
+ about the damage at Jacksonville State University, the impact on the surrounding
16
+ areas, and the broader effects of the storm.
17
+ - 'The labor movement has been living in the shadow of a national assault on public-sector
18
+ collective bargaining for a while now. We’ve talked a lot about Harris v. Quinn,
19
+ how labor dodged a bullet with that case, and dodged another with the death of
20
+ Scalia before the Friedrichs case could be decided. But Janus v. American Federation
21
+ of State, County, and Municipal Employees, Council 31 is likely to be the case
22
+ labor has been dreading, and we break it down for you today with Andy Stettner
23
+ of the Century Foundation.
24
+
25
+ We also look at Uber’s failures in London and neoliberalism’s failures in France,
26
+ a union drive at the Los Angeles Times and a labor solidarity mission to Puerto
27
+ Rico post-hurricanes. For Argh, we consider forced labor “rehab” facilities, and
28
+ how moving left is the solution to the rise of the populist right.
29
+
30
+ If you think our work is worth supporting as we soldier on through Trumplandia,
31
+ please consider becoming a sustaining member of Belabored or donating or subscribing
32
+ to Dissent. Help keep us going for the next 136 episodes!'
33
+ - 'Severe weather that spawned at least one tornado slammed Alabama’s Jacksonville
34
+ State University on Monday night and took aim at the rest of the southeast.
35
+
36
+ Alabama state troopers said the damage in Jacksonville, Ala. left the city looking
37
+ like a “war zone.” Strong winds downed trees and damaged buildings as the National
38
+ Weather Service confirmed a “damaging and possibly large tornado near Jacksonville
39
+ and Calhoun counties and was moving east.
40
+
41
+ Jacksonville State University Athletic Director Greg Seitz wrote in a tweet that
42
+ there was significant damage to campus, including to the newly renovated Pete
43
+ Mathews Coliseum.
44
+
45
+ "I can confirm we have major roof damage at Pete Mathews Coliseum, but The Pete
46
+ is not completely destroyed," Seitz said in a tweet.
47
+
48
+ Tuscaloosa County Sheriff’s Office Lt. Andy Norris said in a tweet that troopers
49
+ called Jacksonville a “war zone.” He said the arena’s roof “took major damage.”
50
+
51
+ Photos seen on social media showed the extent of the damage Jacksonville took.
52
+
53
+ Alabama Gov. Kay Ivey confirmed in a statement late Monday there was “significant
54
+ damage” throughout the state, according to WBRC-TV.
55
+
56
+ Cities in northern Alabama reported power outages and the NWS in Huntsville reported
57
+ at least three tornadoes in the area.
58
+
59
+ The severe weather moved into Georgia late Monday night.
60
+
61
+ Flights at Hartsfield Airport in Atlanta were not officially grounded as the damaging
62
+ winds moved into the area. However, the airport warned on Twitter that delays
63
+ were likely.
64
+
65
+ Meanwhile, more than 150 people reportedly took cover into a historic cave in
66
+ Cave Springs, Ga.
67
+
68
+ The storms knocked out power to at least 15,000 homes and businesses in Alabama.
69
+ Georgia Power was rpeorting more than 26,000 of their customers were without power,
70
+ according to Cobb County News.
71
+
72
+ The Associated Press contributed to this report.'
73
+ - source_sentence: NCAA Sexual Violence Policy Criticized as Weak
74
+ sentences:
75
+ - The second text provides details that elaborate on the criticism mentioned in
76
+ the first text. It describes the NCAA's new rules and then presents a specific
77
+ critique, highlighting the perceived weaknesses in the policy, such as the lack
78
+ of strong enforcement and accountability, thus supporting the initial claim of
79
+ weakness.
80
+ - 'CHAMPAIGN -- Illinois had one final chance to finish this week on a recruiting
81
+ strong note. After missing out on three Class of 2018 forwards early in the week,
82
+ the Illini were still in the running for four-star Georgia prospect Landers Nolley.
83
+
84
+ Until Friday morning. Nolley, a 6-foot-7 wing who played his sophomore season
85
+ at Curie in Chicago before moving to Georgia, narrowed his choices to Georgia
86
+ and Virginia Tech.
87
+
88
+ Nolley''s almost final decision left Illinois 0 for 4 on 2018 targets this week
89
+ after Lukas Kisunas (UConn), George Conditt (Iowa State) and Colin Castleton (Michigan)
90
+ all committed elsewhere. That leaves the Illini in further pursuit of in-state
91
+ targets like Morgan Park''s Ayo Dosunmu, who will start an official visit at Illinois
92
+ on Oct. 13, and Simeon''s Talen Horton-Tucker.'
93
+ - 'The National Collegiate Athletic Association adopted rules last week that require
94
+ key administrators to complete annual training on sexual violence prevention,
95
+ and to certify annually that the institution''s teams and programs are familiar
96
+ with policies and processes to prevent sexual violence or to deal with incidents
97
+ that take place. Further, the rules require institutions to provide information
98
+ to athletes on institutional policies and procedures.
99
+
100
+ A column in The Huffington Post noted that the NCAA rules are largely similar
101
+ to what federal law requires of colleges, and that they don''t address issues
102
+ related to athletes found to have assaulted others. What the rules lack, the column
103
+ said, "is enforcement or accountability that approaches penalties reaching the
104
+ [same] level as the purchase of a hamburger for a student athlete."'
105
+ - source_sentence: William few Pkwy and Chamblin Rd new traffic signal - WFXG FOX
106
+ 54 - News Now
107
+ sentences:
108
+ - The second text elaborates on the first by providing details about the traffic
109
+ signal mentioned in the title. It specifies the location (William Few Parkway
110
+ and Chamblin Road) and the schedule for the signal's operation, including the
111
+ dates it will be in flashing and normal modes.
112
+ - 'Columbia County wants to inform the driving public of a new traffic signal installation.
113
+ It’s located at William Few Parkway and Chamblin Road.
114
+
115
+ The light is scheduled to go into flashing mode on Friday October 6th, 2017. The
116
+ signal will remain in flashing mode for the remainder of this week, including
117
+ the weekend. The signal is scheduled to be placed into normal stop and go operation
118
+ on Tuesday, October 10, 2017.
119
+
120
+ Copyright 2017 WFXG. All rights reserved.'
121
+ - 'NEWPORT BEACH, Calif. (AP) — The Latest on a fatal helicopter crash in Southern
122
+ California (all times local):
123
+
124
+ 10:07 a.m.
125
+
126
+ California authorities have released the name of all three people killed when
127
+ a small helicopter crashed in a Newport Beach neighborhood.
128
+
129
+ The Orange County Sheriff''s Department says the dead are 60-year-old Joseph Anthony
130
+ Tena of Newport Beach, 45-year-old Kimberly Lynne Watzman of Santa Monica and
131
+ 56-year-old Brian R. Reichelt of Hollywood.
132
+
133
+ The crash Wednesday in a neighborhood involved four people in the helicopter and
134
+ a bystander. Newport Beach police spokeswoman Jennifer Manzella says all three
135
+ people killed were in the helicopter.
136
+
137
+ There''s no information about two people who were injured.
138
+
139
+ ___
140
+
141
+ 11:03 p.m.
142
+
143
+ Officials say three people were killed and two more injured when a helicopter
144
+ crashed into a home in a suburban Southern California neighborhood.
145
+
146
+ Authorities say four people were aboard the Robinson R44 helicopter when it went
147
+ down in Newport Beach on Tuesday afternoon just a few minutes after taking off
148
+ from John Wayne Airport.
149
+
150
+ One person who was outside on the ground was involved in the crash, though officials
151
+ did not specify who died and who was injured.
152
+
153
+ Neighbor Marian Michaels says she thought it was an earthquake when the helicopter
154
+ slammed into the house.
155
+
156
+ Another neighbor, Roger Johnson, says he heard a scream that sounded like it was
157
+ from a horror movie before rushing to the scene to try to help.'
158
+ - source_sentence: 'Former AG, ex-Jordanian PM top contenders for Pak''s ICJ ad-hoc
159
+ judge choice: report'
160
+ sentences:
161
+ - The second text elaborates on the first by providing details about the contenders
162
+ for the ad-hoc judge position. It names specific individuals (ex-AG and former
163
+ Jordanian PM) and provides context about the case at the ICJ, the nomination process,
164
+ and the sources of the information. The report confirms the information presented
165
+ in the title.
166
+ - 'Image caption The last confirmed sighting of Brian McGowan was in Plean on 21
167
+ September
168
+
169
+ Police searching for a man who has not been seen for more than two weeks are asking
170
+ the public to check outbuildings and gardens for any trace of him.
171
+
172
+ Brian McGowan, 42, was last seen in the Gillespie Terrace area of Plean, near
173
+ Stirling, at 16:00 on 21 September.
174
+
175
+ Investigations have uncovered a "probable" sighting of him in the Gallamuir Drive
176
+ area at 01:30 the following day.
177
+
178
+ Police said that since then he has not returned home or contacted anyone.
179
+
180
+ Insp Donna Bryans said: "Brian has now been missing for two weeks and it is vital
181
+ that we find him.
182
+
183
+ "I would like to thank the local community who have come out to search for Brian
184
+ and helped with our investigations so far.
185
+
186
+ "I would ask residents and visitors to Plean, as well as visitors to Plean Country
187
+ Park, to be vigilant and report any sighting of anyone seen matching Brian''s
188
+ description."
189
+
190
+ Insp Bryans said a search of gardens and outbuildings in the area could help officers
191
+ discover Mr McGowan''s whereabouts.
192
+
193
+ He is described as 5ft 10 tall, of slim build with short dark hair. He had blue
194
+ eyes and tattoos on his fingers and speaks with a local accent.
195
+
196
+ When last seen he was wearing a black baseball cap, a black G-Star jacket, grey
197
+ Armani jumper, grey Adidas tracksuit bottoms with black stripes on the sides and
198
+ black and grey Adidas Y3 trainers.'
199
+ - 'ISLAMABAD: The Pakistan government has begun consultations over the nomination
200
+ of an ad-hoc judge for the Kulbhushan Jadhav case being heard at the International
201
+ Court of Justice with an ex-attorney general and a former Jordanian premier emerging
202
+ as the top contenders, a media report said today. India had moved the Hague-based
203
+ International Court of Justice (ICJ) against Jadhav''s death penalty handed down
204
+ by a Pakistani military court. The ICJ had on May 18 restrained Pakistan from
205
+ executing the death sentence.Pakistan government''s functionaries have started
206
+ consultations for the nomination of an ad-hoc judge, Express Tribune reported,
207
+ citing sources.During the tenure of ousted prime minister Nawaz Sharif , former
208
+ Supreme Court judge Khalilur Rehman Ramday was approached, but he declined the
209
+ nomination, the report said.Sources were quoted by the daily as saying that the
210
+ Attorney General for Pakistan''s (AGP) office has recommended the names of senior
211
+ lawyer Makhdoom Ali Khan and former Jordanian prime minister Awn Shawkat Al-Khasawneh
212
+ to the Prime Minister''s Office for the nomination of one name as an ad-hoc judge.Khasawneh
213
+ served as an ICJ judge for over a decade, while Khan, a former Attorney General
214
+ who is seen as the favourite for the job, also has experience in international
215
+ arbitration cases, having represented eight different countries in international
216
+ courts.The nomination of the ad-hoc judge will be finalised after getting inputs
217
+ from the Foreign Office and the military establishment, the sources said, adding
218
+ that earlier, government functionaries had also considered the name of former
219
+ chief justice of Pakistan Tassaduq Hussain Jillani.An official was quoted as saying
220
+ that the name of the ad- hoc judge will be finalised next month, soon after the
221
+ Indian side files its documents.Meanwhile, Pakistan Bar Council (PBC) representative
222
+ Raheel Kamran Sheikh has called upon the government to seek Parliament''s approval
223
+ on the appointment of the ad-hoc judge.Only one person has previously been appointed
224
+ as ICJ judge in Pakistan''s history -- former foreign minister Zafarullah Khan,
225
+ who was appointed in 1954 and later became the president of the court.Yaqub Ali
226
+ Khan and Sharifuddin Pirzada both served as ad-hoc judges, as did Zafarullah.'
227
+ - source_sentence: Energy advocates call for new commitment to renewable growth
228
+ sentences:
229
+ - The second text elaborates on the first by providing details about the specific
230
+ context of the energy advocates' call for renewable growth. It identifies the
231
+ advocates (CFE, VoteSolar, Environment Connecticut), the specific renewable energy
232
+ program (community solar), and the reasons for their call, including program delays
233
+ and design flaws.
234
+ - 'The piece below was submitted by CFE, VoteSolar, and Environment Connecticut
235
+ in response to the latest delay in the shared solar pilot program.
236
+
237
+ Solar and environmental advocates are calling for a new community solar program
238
+ in Connecticut that will expand solar access, energy choices and consumer savings
239
+ for families, municipalities, and businesses statewide. The demand follows today’s
240
+ Department of Energy and Environmental Protection (DEEP) technical hearing where
241
+ attendees reviewed the state’s current Shared Clean Energy Facilities pilot program.
242
+ The pilot has stalled several times over the last two years, most recently following
243
+ DEEP’s decision to scrap all the proposals they have received and issue a new
244
+ request for projects. DEEP heard from many advocates and developers at the hearing
245
+ who are frustrated with this latest delay and skeptical about the long term success
246
+ of the pilot.
247
+
248
+ The current pilot program was meant to expand solar access to Connecticut energy
249
+ customers who can’t put solar on their own roof, but it contained flaws that have
250
+ prevented any development to date. As set out in the legislation, the program
251
+ has several poor design elements and a goal too small to draw significant private
252
+ sector interest. Below are statements from stakeholders in Connecticut’s clean
253
+ energy economy:
254
+
255
+ “For years, Connecticut has missed out on the opportunity to bring solar energy
256
+ choices to all consumers and more clean energy jobs to the state,” said Sean Garren,
257
+ Northeast Regional director for Vote Solar. “Connecticut’s lackluster community
258
+ solar program hasn’t unlocked the benefits of solar access for a single resident
259
+ to date due to poor design and a lack of ambition at the scale needed, brought
260
+ about by the electric utilities’ intervention. We’re calling on the legislature
261
+ to catch up to the rest of New England — and the nation — with a smart, well-structured
262
+ community solar program designed to serve consumers statewide.”
263
+
264
+ “Two years of foot dragging and refusal by the Department of Energy and Environmental
265
+ Protection to follow the law and implement a community solar program is preventing
266
+ tens of thousands of Connecticut families from gaining access to clean, affordable,
267
+ secure solar power,” said Chris Phelps, State Director for Environment Connecticut.
268
+ “Community solar is helping other states accelerate solar growth, create jobs,
269
+ and cut pollution. Connecticut policy makers should take action now to create
270
+ a bold community solar program.”
271
+
272
+ “Shared solar programs have been sweeping the nation for the last decade, but
273
+ Connecticut has been left in the shade — losing out on healthier air, investment
274
+ dollars, and green jobs that would accompany a full-scale, statewide shared solar
275
+ program,” said Claire Coleman, Climate and Energy Attorney for Connecticut Fund
276
+ for the Environment. “DEEP’s decision to start over with the already overly-restrictive
277
+ shared solar pilot puts Connecticut further in the dark. Our climate and economy
278
+ cannot wait any longer. Connecticut’s leaders must move quickly to ramp up in-state
279
+ renewables through a full-scale shared solar program if Connecticut is going to
280
+ have any chance of meeting its obligations under the Global Warming Solutions
281
+ Act to reduce greenhouse gas emissions.”
282
+
283
+ Vote Solar is a nonprofit organization working to foster economic development
284
+ and energy independence by bringing solar energy to the mainstream nationwide.
285
+ Learn more at votesolar.org.'
286
+ - 'BEIJING: China will waive income tax for three years for foreign investors trading
287
+ the country’s new crude futures contract, the Ministry of Finance said on Tuesday,
288
+ in a bid to attract overseas capital for the much anticipated launch.
289
+
290
+ The start of trading on Monday will mark the culmination of a years-long push
291
+ by China to create Asia’s first oil futures benchmark, and is aimed at giving
292
+ the world’s biggest oil importer more clout in pricing crude sold to Asia.
293
+
294
+ It will potentially give the Shanghai International Energy Exchange, which will
295
+ operate the new contract, a share of the trillions of dollars each year in oil
296
+ futures trading.
297
+
298
+ The finance ministry said foreign brokers will be exempted from paying income
299
+ tax on commissions they earn from dealing in the new Shanghai crude futures.
300
+
301
+ The tax exemption could help encourage foreign players to engage with the new
302
+ contract, despite concerns about issues such as foreign exchange conversion and
303
+ potential capital curbs.
304
+
305
+ The number of foreign investors seeking to open non-resident accounts to allow
306
+ trading has so far been below expectations, a source at CITIC, one of eight banks
307
+ that is handling margin deposits for foreign investors, said. The source declined
308
+ to be named as he is not authorized to talk with media.
309
+
310
+ The oil market is closely watching the liquidity of the contract, as institutional
311
+ investors and brokers expect trading volumes and open interest to be relatively
312
+ small compared with China’s iron ore, copper and steel futures contracts.
313
+
314
+ China in recent days has provided more details on the contract, including margins,
315
+ trading limits and transaction fees, and has approved the use of six bonded storage
316
+ warehouses.'
317
+ datasets:
318
+ - bwang0911/reasoning_pairs_filtered_w_reason_ccnews
319
+ pipeline_tag: sentence-similarity
320
+ library_name: sentence-transformers
321
+ metrics:
322
+ - cosine_accuracy@1
323
+ - cosine_accuracy@3
324
+ - cosine_accuracy@5
325
+ - cosine_accuracy@10
326
+ - cosine_precision@1
327
+ - cosine_precision@3
328
+ - cosine_precision@5
329
+ - cosine_precision@10
330
+ - cosine_recall@1
331
+ - cosine_recall@3
332
+ - cosine_recall@5
333
+ - cosine_recall@10
334
+ - cosine_ndcg@10
335
+ - cosine_mrr@10
336
+ - cosine_map@100
337
+ model-index:
338
+ - name: SentenceTransformer based on google-bert/bert-base-uncased
339
+ results:
340
+ - task:
341
+ type: information-retrieval
342
+ name: Information Retrieval
343
+ dataset:
344
+ name: mteb/nfcorpus
345
+ type: mteb/nfcorpus
346
+ metrics:
347
+ - type: cosine_accuracy@1
348
+ value: 0.3126934984520124
349
+ name: Cosine Accuracy@1
350
+ - type: cosine_accuracy@3
351
+ value: 0.47678018575851394
352
+ name: Cosine Accuracy@3
353
+ - type: cosine_accuracy@5
354
+ value: 0.5325077399380805
355
+ name: Cosine Accuracy@5
356
+ - type: cosine_accuracy@10
357
+ value: 0.5975232198142415
358
+ name: Cosine Accuracy@10
359
+ - type: cosine_precision@1
360
+ value: 0.3126934984520124
361
+ name: Cosine Precision@1
362
+ - type: cosine_precision@3
363
+ value: 0.2549019607843137
364
+ name: Cosine Precision@3
365
+ - type: cosine_precision@5
366
+ value: 0.20990712074303408
367
+ name: Cosine Precision@5
368
+ - type: cosine_precision@10
369
+ value: 0.16563467492260062
370
+ name: Cosine Precision@10
371
+ - type: cosine_recall@1
372
+ value: 0.03117827434222373
373
+ name: Cosine Recall@1
374
+ - type: cosine_recall@3
375
+ value: 0.05624265377613812
376
+ name: Cosine Recall@3
377
+ - type: cosine_recall@5
378
+ value: 0.06877168791903203
379
+ name: Cosine Recall@5
380
+ - type: cosine_recall@10
381
+ value: 0.09700903168215257
382
+ name: Cosine Recall@10
383
+ - type: cosine_ndcg@10
384
+ value: 0.21852791504742514
385
+ name: Cosine Ndcg@10
386
+ - type: cosine_mrr@10
387
+ value: 0.40163890117450485
388
+ name: Cosine Mrr@10
389
+ - type: cosine_map@100
390
+ value: 0.08949558554054256
391
+ name: Cosine Map@100
392
+ - task:
393
+ type: information-retrieval
394
+ name: Information Retrieval
395
+ dataset:
396
+ name: mteb/trec covid
397
+ type: mteb/trec-covid
398
+ metrics:
399
+ - type: cosine_accuracy@1
400
+ value: 0.62
401
+ name: Cosine Accuracy@1
402
+ - type: cosine_accuracy@3
403
+ value: 0.82
404
+ name: Cosine Accuracy@3
405
+ - type: cosine_accuracy@5
406
+ value: 0.92
407
+ name: Cosine Accuracy@5
408
+ - type: cosine_accuracy@10
409
+ value: 0.94
410
+ name: Cosine Accuracy@10
411
+ - type: cosine_precision@1
412
+ value: 0.62
413
+ name: Cosine Precision@1
414
+ - type: cosine_precision@3
415
+ value: 0.5599999999999999
416
+ name: Cosine Precision@3
417
+ - type: cosine_precision@5
418
+ value: 0.5519999999999999
419
+ name: Cosine Precision@5
420
+ - type: cosine_precision@10
421
+ value: 0.512
422
+ name: Cosine Precision@10
423
+ - type: cosine_recall@1
424
+ value: 0.0005213598128605203
425
+ name: Cosine Recall@1
426
+ - type: cosine_recall@3
427
+ value: 0.0014060584814840184
428
+ name: Cosine Recall@3
429
+ - type: cosine_recall@5
430
+ value: 0.0023515414225962748
431
+ name: Cosine Recall@5
432
+ - type: cosine_recall@10
433
+ value: 0.004357324560804962
434
+ name: Cosine Recall@10
435
+ - type: cosine_ndcg@10
436
+ value: 0.5323227421340048
437
+ name: Cosine Ndcg@10
438
+ - type: cosine_mrr@10
439
+ value: 0.7306666666666668
440
+ name: Cosine Mrr@10
441
+ - type: cosine_map@100
442
+ value: 0.22987991064708832
443
+ name: Cosine Map@100
444
+ - task:
445
+ type: information-retrieval
446
+ name: Information Retrieval
447
+ dataset:
448
+ name: mteb/fiqa
449
+ type: mteb/fiqa
450
+ metrics:
451
+ - type: cosine_accuracy@1
452
+ value: 0.13734567901234568
453
+ name: Cosine Accuracy@1
454
+ - type: cosine_accuracy@3
455
+ value: 0.22839506172839505
456
+ name: Cosine Accuracy@3
457
+ - type: cosine_accuracy@5
458
+ value: 0.2700617283950617
459
+ name: Cosine Accuracy@5
460
+ - type: cosine_accuracy@10
461
+ value: 0.345679012345679
462
+ name: Cosine Accuracy@10
463
+ - type: cosine_precision@1
464
+ value: 0.13734567901234568
465
+ name: Cosine Precision@1
466
+ - type: cosine_precision@3
467
+ value: 0.09310699588477366
468
+ name: Cosine Precision@3
469
+ - type: cosine_precision@5
470
+ value: 0.06944444444444445
471
+ name: Cosine Precision@5
472
+ - type: cosine_precision@10
473
+ value: 0.04645061728395062
474
+ name: Cosine Precision@10
475
+ - type: cosine_recall@1
476
+ value: 0.0697683960415442
477
+ name: Cosine Recall@1
478
+ - type: cosine_recall@3
479
+ value: 0.12649965346724604
480
+ name: Cosine Recall@3
481
+ - type: cosine_recall@5
482
+ value: 0.15659102129009536
483
+ name: Cosine Recall@5
484
+ - type: cosine_recall@10
485
+ value: 0.19997600136489024
486
+ name: Cosine Recall@10
487
+ - type: cosine_ndcg@10
488
+ value: 0.15747637847224993
489
+ name: Cosine Ndcg@10
490
+ - type: cosine_mrr@10
491
+ value: 0.19570105820105824
492
+ name: Cosine Mrr@10
493
+ - type: cosine_map@100
494
+ value: 0.12811920879354669
495
+ name: Cosine Map@100
496
+ - task:
497
+ type: information-retrieval
498
+ name: Information Retrieval
499
+ dataset:
500
+ name: mteb/quora
501
+ type: mteb/quora
502
+ metrics:
503
+ - type: cosine_accuracy@1
504
+ value: 0.7256
505
+ name: Cosine Accuracy@1
506
+ - type: cosine_accuracy@3
507
+ value: 0.8531
508
+ name: Cosine Accuracy@3
509
+ - type: cosine_accuracy@5
510
+ value: 0.8898
511
+ name: Cosine Accuracy@5
512
+ - type: cosine_accuracy@10
513
+ value: 0.9263
514
+ name: Cosine Accuracy@10
515
+ - type: cosine_precision@1
516
+ value: 0.7256
517
+ name: Cosine Precision@1
518
+ - type: cosine_precision@3
519
+ value: 0.33316666666666667
520
+ name: Cosine Precision@3
521
+ - type: cosine_precision@5
522
+ value: 0.21984
523
+ name: Cosine Precision@5
524
+ - type: cosine_precision@10
525
+ value: 0.12146000000000004
526
+ name: Cosine Precision@10
527
+ - type: cosine_recall@1
528
+ value: 0.6303186330948595
529
+ name: Cosine Recall@1
530
+ - type: cosine_recall@3
531
+ value: 0.7900249099696033
532
+ name: Cosine Recall@3
533
+ - type: cosine_recall@5
534
+ value: 0.838050682910748
535
+ name: Cosine Recall@5
536
+ - type: cosine_recall@10
537
+ value: 0.887497633693034
538
+ name: Cosine Recall@10
539
+ - type: cosine_ndcg@10
540
+ value: 0.8013139502721578
541
+ name: Cosine Ndcg@10
542
+ - type: cosine_mrr@10
543
+ value: 0.7959599603174561
544
+ name: Cosine Mrr@10
545
+ - type: cosine_map@100
546
+ value: 0.764750227681921
547
+ name: Cosine Map@100
548
+ ---
549
+
550
+ # SentenceTransformer based on google-bert/bert-base-uncased
551
+
552
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) on the [reason_unfiltered](https://huggingface.co/datasets/bwang0911/reasoning_pairs_filtered_w_reason_ccnews) dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
553
+
554
+ ## Model Details
555
+
556
+ ### Model Description
557
+ - **Model Type:** Sentence Transformer
558
+ - **Base model:** [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) <!-- at revision 86b5e0934494bd15c9632b12f734a8a67f723594 -->
559
+ - **Maximum Sequence Length:** 196 tokens
560
+ - **Output Dimensionality:** 768 dimensions
561
+ - **Similarity Function:** Cosine Similarity
562
+ - **Training Dataset:**
563
+ - [reason_unfiltered](https://huggingface.co/datasets/bwang0911/reasoning_pairs_filtered_w_reason_ccnews)
564
+ <!-- - **Language:** Unknown -->
565
+ <!-- - **License:** Unknown -->
566
+
567
+ ### Model Sources
568
+
569
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
570
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
571
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
572
+
573
+ ### Full Model Architecture
574
+
575
+ ```
576
+ SentenceTransformer(
577
+ (0): Transformer({'max_seq_length': 196, 'do_lower_case': False}) with Transformer model: BertModel
578
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
579
+ )
580
+ ```
581
+
582
+ ## Usage
583
+
584
+ ### Direct Usage (Sentence Transformers)
585
+
586
+ First install the Sentence Transformers library:
587
+
588
+ ```bash
589
+ pip install -U sentence-transformers
590
+ ```
591
+
592
+ Then you can load this model and run inference.
593
+ ```python
594
+ from sentence_transformers import SentenceTransformer
595
+
596
+ # Download from the 🤗 Hub
597
+ model = SentenceTransformer("bwang0911/reasoning-bert-ccnews")
598
+ # Run inference
599
+ sentences = [
600
+ 'Energy advocates call for new commitment to renewable growth',
601
+ 'The piece below was submitted by CFE, VoteSolar, and Environment Connecticut in response to the latest delay in the shared solar pilot program.\nSolar and environmental advocates are calling for a new community solar program in Connecticut that will expand solar access, energy choices and consumer savings for families, municipalities, and businesses statewide. The demand follows today’s Department of Energy and Environmental Protection (DEEP) technical hearing where attendees reviewed the state’s current Shared Clean Energy Facilities pilot program. The pilot has stalled several times over the last two years, most recently following DEEP’s decision to scrap all the proposals they have received and issue a new request for projects. DEEP heard from many advocates and developers at the hearing who are frustrated with this latest delay and skeptical about the long term success of the pilot.\nThe current pilot program was meant to expand solar access to Connecticut energy customers who can’t put solar on their own roof, but it contained flaws that have prevented any development to date. As set out in the legislation, the program has several poor design elements and a goal too small to draw significant private sector interest. Below are statements from stakeholders in Connecticut’s clean energy economy:\n“For years, Connecticut has missed out on the opportunity to bring solar energy choices to all consumers and more clean energy jobs to the state,” said Sean Garren, Northeast Regional director for Vote Solar. “Connecticut’s lackluster community solar program hasn’t unlocked the benefits of solar access for a single resident to date due to poor design and a lack of ambition at the scale needed, brought about by the electric utilities’ intervention. We’re calling on the legislature to catch up to the rest of New England — and the nation — with a smart, well-structured community solar program designed to serve consumers statewide.”\n“Two years of foot dragging and refusal by the Department of Energy and Environmental Protection to follow the law and implement a community solar program is preventing tens of thousands of Connecticut families from gaining access to clean, affordable, secure solar power,” said Chris Phelps, State Director for Environment Connecticut. “Community solar is helping other states accelerate solar growth, create jobs, and cut pollution. Connecticut policy makers should take action now to create a bold community solar program.”\n“Shared solar programs have been sweeping the nation for the last decade, but Connecticut has been left in the shade — losing out on healthier air, investment dollars, and green jobs that would accompany a full-scale, statewide shared solar program,” said Claire Coleman, Climate and Energy Attorney for Connecticut Fund for the Environment. “DEEP’s decision to start over with the already overly-restrictive shared solar pilot puts Connecticut further in the dark. Our climate and economy cannot wait any longer. Connecticut’s leaders must move quickly to ramp up in-state renewables through a full-scale shared solar program if Connecticut is going to have any chance of meeting its obligations under the Global Warming Solutions Act to reduce greenhouse gas emissions.”\nVote Solar is a nonprofit organization working to foster economic development and energy independence by bringing solar energy to the mainstream nationwide. Learn more at votesolar.org.',
602
+ "The second text elaborates on the first by providing details about the specific context of the energy advocates' call for renewable growth. It identifies the advocates (CFE, VoteSolar, Environment Connecticut), the specific renewable energy program (community solar), and the reasons for their call, including program delays and design flaws.",
603
+ ]
604
+ embeddings = model.encode(sentences)
605
+ print(embeddings.shape)
606
+ # [3, 768]
607
+
608
+ # Get the similarity scores for the embeddings
609
+ similarities = model.similarity(embeddings, embeddings)
610
+ print(similarities.shape)
611
+ # [3, 3]
612
+ ```
613
+
614
+ <!--
615
+ ### Direct Usage (Transformers)
616
+
617
+ <details><summary>Click to see the direct usage in Transformers</summary>
618
+
619
+ </details>
620
+ -->
621
+
622
+ <!--
623
+ ### Downstream Usage (Sentence Transformers)
624
+
625
+ You can finetune this model on your own dataset.
626
+
627
+ <details><summary>Click to expand</summary>
628
+
629
+ </details>
630
+ -->
631
+
632
+ <!--
633
+ ### Out-of-Scope Use
634
+
635
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
636
+ -->
637
+
638
+ ## Evaluation
639
+
640
+ ### Metrics
641
+
642
+ #### Information Retrieval
643
+
644
+ * Datasets: `mteb/nfcorpus`, `mteb/trec-covid`, `mteb/fiqa` and `mteb/quora`
645
+ * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
646
+
647
+ | Metric | mteb/nfcorpus | mteb/trec-covid | mteb/fiqa | mteb/quora |
648
+ |:--------------------|:--------------|:----------------|:-----------|:-----------|
649
+ | cosine_accuracy@1 | 0.3127 | 0.62 | 0.1373 | 0.7256 |
650
+ | cosine_accuracy@3 | 0.4768 | 0.82 | 0.2284 | 0.8531 |
651
+ | cosine_accuracy@5 | 0.5325 | 0.92 | 0.2701 | 0.8898 |
652
+ | cosine_accuracy@10 | 0.5975 | 0.94 | 0.3457 | 0.9263 |
653
+ | cosine_precision@1 | 0.3127 | 0.62 | 0.1373 | 0.7256 |
654
+ | cosine_precision@3 | 0.2549 | 0.56 | 0.0931 | 0.3332 |
655
+ | cosine_precision@5 | 0.2099 | 0.552 | 0.0694 | 0.2198 |
656
+ | cosine_precision@10 | 0.1656 | 0.512 | 0.0465 | 0.1215 |
657
+ | cosine_recall@1 | 0.0312 | 0.0005 | 0.0698 | 0.6303 |
658
+ | cosine_recall@3 | 0.0562 | 0.0014 | 0.1265 | 0.79 |
659
+ | cosine_recall@5 | 0.0688 | 0.0024 | 0.1566 | 0.8381 |
660
+ | cosine_recall@10 | 0.097 | 0.0044 | 0.2 | 0.8875 |
661
+ | **cosine_ndcg@10** | **0.2185** | **0.5323** | **0.1575** | **0.8013** |
662
+ | cosine_mrr@10 | 0.4016 | 0.7307 | 0.1957 | 0.796 |
663
+ | cosine_map@100 | 0.0895 | 0.2299 | 0.1281 | 0.7648 |
664
+
665
+ <!--
666
+ ## Bias, Risks and Limitations
667
+
668
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
669
+ -->
670
+
671
+ <!--
672
+ ### Recommendations
673
+
674
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
675
+ -->
676
+
677
+ ## Training Details
678
+
679
+ ### Training Dataset
680
+
681
+ #### reason_unfiltered
682
+
683
+ * Dataset: [reason_unfiltered](https://huggingface.co/datasets/bwang0911/reasoning_pairs_filtered_w_reason_ccnews) at [2e4fb05](https://huggingface.co/datasets/bwang0911/reasoning_pairs_filtered_w_reason_ccnews/tree/2e4fb0585e862af0623b97b64d34325001b218a2)
684
+ * Size: 44,978 training samples
685
+ * Columns: <code>title</code>, <code>body</code>, and <code>reason</code>
686
+ * Approximate statistics based on the first 1000 samples:
687
+ | | title | body | reason |
688
+ |:--------|:----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
689
+ | type | string | string | string |
690
+ | details | <ul><li>min: 6 tokens</li><li>mean: 15.34 tokens</li><li>max: 42 tokens</li></ul> | <ul><li>min: 21 tokens</li><li>mean: 178.04 tokens</li><li>max: 196 tokens</li></ul> | <ul><li>min: 28 tokens</li><li>mean: 59.19 tokens</li><li>max: 88 tokens</li></ul> |
691
+ * Samples:
692
+ | title | body | reason |
693
+ |:----------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
694
+ | <code>Fight Leaves Wayne Simmonds Shirtless</code> | <code>Reed Saxon/AP Images<br>Kevin Bieksa and Wayne Simmonds dropped the gloves just 95 seconds into last night’s 4-3 Ducks shootout win over the Flyers, and Bieksa immediately yanked his opponent’s jersey over his head, to the delight of the crowd and to grins from Simmonds and the officials.<br>That’s not supposed to happen. NHL players wear something called a fight strap, which binds the back of the jersey to the pants, preventing the jersey from being pulled off. (Losing a jersey is an advantage in a fight, as it gives the shirtless player’s opponent nothing to grab on to. Sabres enforcer Rob Ray was notorious for losing his gear in a fight, occasionally taking it off himself before clinching.) Any player who engaged in a fight without wearing a fight strap is subject to an automatic game misconduct.<br>Advertisement<br>Simmonds wasn’t ejected, though; at the one-minute mark of the video above, you can see he did have his fight strap properly attached. It just broke, which happens on occasion.</code> | <code>The article describes a hockey fight involving Wayne Simmonds, confirming the title's claim. It details the fight, including Simmonds' jersey being pulled off, and explains the rules and context around the incident, directly elaborating on the event suggested by the title.</code> |
695
+ | <code>Merck CEO Kenneth Frazier ditches Trump over Charlottesville silence</code> | <code>Merck CEO Kenneth C. Frazier resigned from the president’s council on manufacturing Monday in direct protest of President Donald Trump’s lack of condemnation of white nationalist actions in Charlottesville, Va. over the weekend.<br>In a statement, Frazier, who is African-American, said he believes the country’s strength comes from the diversity of its citizens and that he feels personally compelled to stand up for that diversity and against intolerance.<br>“America’s leaders must honor our fundamental values by clearly rejecting expressions of hatred, bigotry and group supremacy, which run counter to the American ideal that all people are created equal,” he wrote. “As CEO of Merck, and as a matter of personal conscience, I feel a responsibility to take a stand against intolerance and extremism.”<br>RELATED: At least one death has been confirmed after a car plowed into a crowd of protesters in Charlottesville<br>Trump immediately fired back at Frazier on Twitter, saying the Merck CEO now “will have...</code> | <code>The second text provides a detailed elaboration of the first. It explains the context of Kenneth Frazier's resignation, the reasons behind it (Trump's silence on Charlottesville), and includes Frazier's statement. It also provides additional background information about Frazier and the President's Manufacturing Council.</code> |
696
+ | <code>Lightning's Braydon Coburn: Joining road trip</code> | <code>Coburn (lower body) will travel with the team on its upcoming four-game road trip and is hoping to play at some point in the second half of the trip, Bryan Burns of the Lightning's official site reports.<br>The veteran blueliner is yet to play in the month of December, having already missed four games. However, the fact that Coburn is traveling with the team and has been given a chance to play at some point within the next week will be music to the ears of fantasy owners who benefited from Coburn's surprising production -- seven points in 25 games -- earlier in the season. Keep an eye out for updates as the trip progresses.</code> | <code>The second text elaborates on the first by providing details about Braydon Coburn's situation. It specifies that he will join the team on a road trip and offers context about his injury, recovery timeline, and potential for playing, directly expanding on the initial announcement.</code> |
697
+ * Loss: [<code>ReasoningGuidedRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#reasoningguidedrankingloss) with these parameters:
698
+ ```json
699
+ {
700
+ "scale": 20.0,
701
+ "similarity_fct": "cos_sim"
702
+ }
703
+ ```
704
+
705
+ ### Training Hyperparameters
706
+ #### Non-Default Hyperparameters
707
+
708
+ - `eval_strategy`: steps
709
+ - `per_device_train_batch_size`: 256
710
+ - `learning_rate`: 1e-05
711
+ - `warmup_ratio`: 0.1
712
+ - `fp16`: True
713
+ - `batch_sampler`: no_duplicates
714
+
715
+ #### All Hyperparameters
716
+ <details><summary>Click to expand</summary>
717
+
718
+ - `overwrite_output_dir`: False
719
+ - `do_predict`: False
720
+ - `eval_strategy`: steps
721
+ - `prediction_loss_only`: True
722
+ - `per_device_train_batch_size`: 256
723
+ - `per_device_eval_batch_size`: 8
724
+ - `per_gpu_train_batch_size`: None
725
+ - `per_gpu_eval_batch_size`: None
726
+ - `gradient_accumulation_steps`: 1
727
+ - `eval_accumulation_steps`: None
728
+ - `torch_empty_cache_steps`: None
729
+ - `learning_rate`: 1e-05
730
+ - `weight_decay`: 0.0
731
+ - `adam_beta1`: 0.9
732
+ - `adam_beta2`: 0.999
733
+ - `adam_epsilon`: 1e-08
734
+ - `max_grad_norm`: 1.0
735
+ - `num_train_epochs`: 3
736
+ - `max_steps`: -1
737
+ - `lr_scheduler_type`: linear
738
+ - `lr_scheduler_kwargs`: {}
739
+ - `warmup_ratio`: 0.1
740
+ - `warmup_steps`: 0
741
+ - `log_level`: passive
742
+ - `log_level_replica`: warning
743
+ - `log_on_each_node`: True
744
+ - `logging_nan_inf_filter`: True
745
+ - `save_safetensors`: True
746
+ - `save_on_each_node`: False
747
+ - `save_only_model`: False
748
+ - `restore_callback_states_from_checkpoint`: False
749
+ - `no_cuda`: False
750
+ - `use_cpu`: False
751
+ - `use_mps_device`: False
752
+ - `seed`: 42
753
+ - `data_seed`: None
754
+ - `jit_mode_eval`: False
755
+ - `use_ipex`: False
756
+ - `bf16`: False
757
+ - `fp16`: True
758
+ - `fp16_opt_level`: O1
759
+ - `half_precision_backend`: auto
760
+ - `bf16_full_eval`: False
761
+ - `fp16_full_eval`: False
762
+ - `tf32`: None
763
+ - `local_rank`: 0
764
+ - `ddp_backend`: None
765
+ - `tpu_num_cores`: None
766
+ - `tpu_metrics_debug`: False
767
+ - `debug`: []
768
+ - `dataloader_drop_last`: False
769
+ - `dataloader_num_workers`: 0
770
+ - `dataloader_prefetch_factor`: None
771
+ - `past_index`: -1
772
+ - `disable_tqdm`: False
773
+ - `remove_unused_columns`: True
774
+ - `label_names`: None
775
+ - `load_best_model_at_end`: False
776
+ - `ignore_data_skip`: False
777
+ - `fsdp`: []
778
+ - `fsdp_min_num_params`: 0
779
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
780
+ - `tp_size`: 0
781
+ - `fsdp_transformer_layer_cls_to_wrap`: None
782
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
783
+ - `deepspeed`: None
784
+ - `label_smoothing_factor`: 0.0
785
+ - `optim`: adamw_torch
786
+ - `optim_args`: None
787
+ - `adafactor`: False
788
+ - `group_by_length`: False
789
+ - `length_column_name`: length
790
+ - `ddp_find_unused_parameters`: None
791
+ - `ddp_bucket_cap_mb`: None
792
+ - `ddp_broadcast_buffers`: False
793
+ - `dataloader_pin_memory`: True
794
+ - `dataloader_persistent_workers`: False
795
+ - `skip_memory_metrics`: True
796
+ - `use_legacy_prediction_loop`: False
797
+ - `push_to_hub`: False
798
+ - `resume_from_checkpoint`: None
799
+ - `hub_model_id`: None
800
+ - `hub_strategy`: every_save
801
+ - `hub_private_repo`: None
802
+ - `hub_always_push`: False
803
+ - `gradient_checkpointing`: False
804
+ - `gradient_checkpointing_kwargs`: None
805
+ - `include_inputs_for_metrics`: False
806
+ - `include_for_metrics`: []
807
+ - `eval_do_concat_batches`: True
808
+ - `fp16_backend`: auto
809
+ - `push_to_hub_model_id`: None
810
+ - `push_to_hub_organization`: None
811
+ - `mp_parameters`:
812
+ - `auto_find_batch_size`: False
813
+ - `full_determinism`: False
814
+ - `torchdynamo`: None
815
+ - `ray_scope`: last
816
+ - `ddp_timeout`: 1800
817
+ - `torch_compile`: False
818
+ - `torch_compile_backend`: None
819
+ - `torch_compile_mode`: None
820
+ - `dispatch_batches`: None
821
+ - `split_batches`: None
822
+ - `include_tokens_per_second`: False
823
+ - `include_num_input_tokens_seen`: False
824
+ - `neftune_noise_alpha`: None
825
+ - `optim_target_modules`: None
826
+ - `batch_eval_metrics`: False
827
+ - `eval_on_start`: False
828
+ - `use_liger_kernel`: False
829
+ - `eval_use_gather_object`: False
830
+ - `average_tokens_across_devices`: False
831
+ - `prompts`: None
832
+ - `batch_sampler`: no_duplicates
833
+ - `multi_dataset_batch_sampler`: proportional
834
+
835
+ </details>
836
+
837
+ ### Training Logs
838
+ | Epoch | Step | Training Loss | mteb/nfcorpus_cosine_ndcg@10 | mteb/trec-covid_cosine_ndcg@10 | mteb/fiqa_cosine_ndcg@10 | mteb/quora_cosine_ndcg@10 |
839
+ |:------:|:----:|:-------------:|:----------------------------:|:------------------------------:|:------------------------:|:-------------------------:|
840
+ | -1 | -1 | - | 0.0583 | 0.2174 | 0.0237 | 0.6103 |
841
+ | 0.0568 | 10 | 3.443 | - | - | - | - |
842
+ | 0.1136 | 20 | 2.9692 | - | - | - | - |
843
+ | 0.1705 | 30 | 2.1061 | - | - | - | - |
844
+ | 0.2273 | 40 | 1.3012 | 0.0901 | 0.3585 | 0.0642 | 0.7024 |
845
+ | 0.2841 | 50 | 0.9825 | - | - | - | - |
846
+ | 0.3409 | 60 | 0.7112 | - | - | - | - |
847
+ | 0.3977 | 70 | 0.5853 | - | - | - | - |
848
+ | 0.4545 | 80 | 0.5555 | 0.1714 | 0.5160 | 0.1287 | 0.7800 |
849
+ | 0.5114 | 90 | 0.4633 | - | - | - | - |
850
+ | 0.5682 | 100 | 0.4216 | - | - | - | - |
851
+ | 0.625 | 110 | 0.3846 | - | - | - | - |
852
+ | 0.6818 | 120 | 0.4017 | 0.1923 | 0.5446 | 0.1417 | 0.7890 |
853
+ | 0.7386 | 130 | 0.3606 | - | - | - | - |
854
+ | 0.7955 | 140 | 0.3731 | - | - | - | - |
855
+ | 0.8523 | 150 | 0.3451 | - | - | - | - |
856
+ | 0.9091 | 160 | 0.3352 | 0.2017 | 0.5343 | 0.1472 | 0.7951 |
857
+ | 0.9659 | 170 | 0.3364 | - | - | - | - |
858
+ | 1.0227 | 180 | 0.2606 | - | - | - | - |
859
+ | 1.0795 | 190 | 0.2627 | - | - | - | - |
860
+ | 1.1364 | 200 | 0.2641 | 0.2065 | 0.5449 | 0.1499 | 0.7963 |
861
+ | 1.1932 | 210 | 0.2448 | - | - | - | - |
862
+ | 1.25 | 220 | 0.2394 | - | - | - | - |
863
+ | 1.3068 | 230 | 0.2433 | - | - | - | - |
864
+ | 1.3636 | 240 | 0.2236 | 0.2096 | 0.5432 | 0.1519 | 0.7975 |
865
+ | 1.4205 | 250 | 0.221 | - | - | - | - |
866
+ | 1.4773 | 260 | 0.2215 | - | - | - | - |
867
+ | 1.5341 | 270 | 0.2291 | - | - | - | - |
868
+ | 1.5909 | 280 | 0.2433 | 0.2102 | 0.5322 | 0.1543 | 0.7994 |
869
+ | 1.6477 | 290 | 0.219 | - | - | - | - |
870
+ | 1.7045 | 300 | 0.2207 | - | - | - | - |
871
+ | 1.7614 | 310 | 0.2102 | - | - | - | - |
872
+ | 1.8182 | 320 | 0.2138 | 0.2163 | 0.5289 | 0.1553 | 0.8006 |
873
+ | 1.875 | 330 | 0.2076 | - | - | - | - |
874
+ | 1.9318 | 340 | 0.2076 | - | - | - | - |
875
+ | 1.9886 | 350 | 0.2066 | - | - | - | - |
876
+ | 2.0455 | 360 | 0.2046 | 0.2154 | 0.5339 | 0.1558 | 0.8006 |
877
+ | 2.1023 | 370 | 0.1844 | - | - | - | - |
878
+ | 2.1591 | 380 | 0.17 | - | - | - | - |
879
+ | 2.2159 | 390 | 0.1913 | - | - | - | - |
880
+ | 2.2727 | 400 | 0.165 | 0.2165 | 0.5339 | 0.1547 | 0.8014 |
881
+ | 2.3295 | 410 | 0.1878 | - | - | - | - |
882
+ | 2.3864 | 420 | 0.1841 | - | - | - | - |
883
+ | 2.4432 | 430 | 0.1683 | - | - | - | - |
884
+ | 2.5 | 440 | 0.1767 | 0.2178 | 0.5307 | 0.1565 | 0.8014 |
885
+ | 2.5568 | 450 | 0.1627 | - | - | - | - |
886
+ | 2.6136 | 460 | 0.161 | - | - | - | - |
887
+ | 2.6705 | 470 | 0.1717 | - | - | - | - |
888
+ | 2.7273 | 480 | 0.1832 | 0.2169 | 0.5341 | 0.1570 | 0.8012 |
889
+ | 2.7841 | 490 | 0.1673 | - | - | - | - |
890
+ | 2.8409 | 500 | 0.1517 | - | - | - | - |
891
+ | 2.8977 | 510 | 0.1797 | - | - | - | - |
892
+ | 2.9545 | 520 | 0.1862 | 0.2185 | 0.5323 | 0.1575 | 0.8013 |
893
+
894
+
895
+ ### Framework Versions
896
+ - Python: 3.10.12
897
+ - Sentence Transformers: 3.5.0.dev0
898
+ - Transformers: 4.50.0
899
+ - PyTorch: 2.6.0+cu124
900
+ - Accelerate: 1.5.2
901
+ - Datasets: 3.4.1
902
+ - Tokenizers: 0.21.1
903
+
904
+ ## Citation
905
+
906
+ ### BibTeX
907
+
908
+ #### Sentence Transformers
909
+ ```bibtex
910
+ @inproceedings{reimers-2019-sentence-bert,
911
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
912
+ author = "Reimers, Nils and Gurevych, Iryna",
913
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
914
+ month = "11",
915
+ year = "2019",
916
+ publisher = "Association for Computational Linguistics",
917
+ url = "https://arxiv.org/abs/1908.10084",
918
+ }
919
+ ```
920
+
921
+ <!--
922
+ ## Glossary
923
+
924
+ *Clearly define terms in order to be accessible across audiences.*
925
+ -->
926
+
927
+ <!--
928
+ ## Model Card Authors
929
+
930
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
931
+ -->
932
+
933
+ <!--
934
+ ## Model Card Contact
935
+
936
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
937
+ -->
config.json ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "BertModel"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "classifier_dropout": null,
7
+ "gradient_checkpointing": false,
8
+ "hidden_act": "gelu",
9
+ "hidden_dropout_prob": 0.1,
10
+ "hidden_size": 768,
11
+ "initializer_range": 0.02,
12
+ "intermediate_size": 3072,
13
+ "layer_norm_eps": 1e-12,
14
+ "max_position_embeddings": 512,
15
+ "model_type": "bert",
16
+ "num_attention_heads": 12,
17
+ "num_hidden_layers": 12,
18
+ "pad_token_id": 0,
19
+ "position_embedding_type": "absolute",
20
+ "torch_dtype": "float32",
21
+ "transformers_version": "4.50.0",
22
+ "type_vocab_size": 2,
23
+ "use_cache": true,
24
+ "vocab_size": 30522
25
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.5.0.dev0",
4
+ "transformers": "4.50.0",
5
+ "pytorch": "2.6.0+cu124"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": "cosine"
10
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:72248aaad3946371d69688860590c84bd07968867d1ed1669f89cdfb52f382ff
3
+ size 437951328
modules.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ }
14
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 196,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "100": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "101": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "102": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "103": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": false,
45
+ "cls_token": "[CLS]",
46
+ "do_lower_case": true,
47
+ "extra_special_tokens": {},
48
+ "mask_token": "[MASK]",
49
+ "max_length": 196,
50
+ "model_max_length": 196,
51
+ "pad_to_multiple_of": null,
52
+ "pad_token": "[PAD]",
53
+ "pad_token_type_id": 0,
54
+ "padding_side": "right",
55
+ "sep_token": "[SEP]",
56
+ "stride": 0,
57
+ "strip_accents": null,
58
+ "tokenize_chinese_chars": true,
59
+ "tokenizer_class": "BertTokenizer",
60
+ "truncation_side": "right",
61
+ "truncation_strategy": "longest_first",
62
+ "unk_token": "[UNK]"
63
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff