Add new SentenceTransformer model.
Browse files- README.md +116 -140
- model.safetensors +1 -1
README.md
CHANGED
@@ -12,108 +12,108 @@ tags:
|
|
12 |
- dataset_size:30288
|
13 |
- loss:MultipleNegativesRankingLoss
|
14 |
widget:
|
15 |
-
- source_sentence: '
|
|
|
16 |
sentences:
|
17 |
-
-
|
18 |
-
|
19 |
-
|
20 |
-
|
21 |
-
- '
|
22 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
23 |
sentences:
|
24 |
-
- '
|
25 |
-
|
26 |
-
|
27 |
-
|
28 |
-
|
29 |
-
|
30 |
-
|
31 |
-
|
32 |
-
|
33 |
-
|
34 |
-
|
35 |
-
|
36 |
-
|
37 |
-
|
38 |
-
|
39 |
-
|
40 |
-
|
41 |
-
|
42 |
-
|
43 |
-
|
44 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
45 |
sentences:
|
46 |
-
-
|
47 |
-
|
48 |
-
|
49 |
-
|
50 |
-
|
51 |
-
|
52 |
-
|
53 |
-
|
54 |
-
|
55 |
-
experience with me? Thanks, stay high and mellow my opioid lovers Side note: if
|
56 |
-
you havent tried plugging I recommend it 100%, no shame, just safe dosing.'
|
57 |
-
- source_sentence: 'what can i do to be a likeable person? what do people look for
|
58 |
-
in friends? what determines our worth as a person? I realized that a lot of my
|
59 |
-
problems come from trying to impress people in order for them to like me and possibly
|
60 |
-
become friends.
|
61 |
-
|
62 |
-
|
63 |
-
|
64 |
-
|
65 |
-
but do people really look at all the things you''ve accomplished and all the things
|
66 |
-
you''ve done to determine if your worthy of being a friend? apparently that seems
|
67 |
-
to be my mindset, and that''s the reason I do things just to impress people
|
68 |
-
|
69 |
-
|
70 |
-
|
71 |
-
|
72 |
-
so what do people look for in others when determining whether they can be a good
|
73 |
-
friend or not. or another way to think of it, what determines our worth as a person? '
|
74 |
sentences:
|
75 |
-
- '
|
76 |
-
-
|
77 |
-
- '
|
78 |
-
|
79 |
-
trying to impress people in order for them to like me and possibly become friends.
|
80 |
-
|
81 |
-
|
82 |
-
|
83 |
-
|
84 |
-
but do people really look at all the things you''ve accomplished and all the things
|
85 |
-
you''ve done to determine if your worthy of being a friend? apparently that seems
|
86 |
-
to be my mindset, and that''s the reason I do things just to impress people
|
87 |
-
|
88 |
-
|
89 |
-
|
90 |
-
|
91 |
-
so what do people look for in others when determining whether they can be a good
|
92 |
-
friend or not. or another way to think of it, what determines our worth as a person? '
|
93 |
-
- source_sentence: 'Goodnight, Texas - The Horse Accident (In Which A Girl Was All
|
94 |
-
But Killed) '
|
95 |
sentences:
|
96 |
-
-
|
97 |
-
|
98 |
-
|
99 |
-
|
100 |
-
|
101 |
-
|
102 |
-
|
103 |
-
|
104 |
-
|
105 |
-
|
106 |
-
|
107 |
-
|
108 |
-
|
109 |
-
|
110 |
-
a
|
111 |
-
|
112 |
-
|
113 |
-
|
114 |
-
|
115 |
-
|
116 |
-
one subnet. You can select only one subnet per Availability Zone. "'
|
117 |
---
|
118 |
|
119 |
# SentenceTransformer based on manuel-couto-pintos/roberta_erisk
|
@@ -165,9 +165,9 @@ from sentence_transformers import SentenceTransformer
|
|
165 |
model = SentenceTransformer("manuel-couto-pintos/roberta_erisk_simcse")
|
166 |
# Run inference
|
167 |
sentences = [
|
168 |
-
'
|
169 |
-
'
|
170 |
-
|
171 |
]
|
172 |
embeddings = model.encode(sentences)
|
173 |
print(embeddings.shape)
|
@@ -228,13 +228,13 @@ You can finetune this model on your own dataset.
|
|
228 |
| | sentence_0 | sentence_1 |
|
229 |
|:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
|
230 |
| type | string | string |
|
231 |
-
| details | <ul><li>min: 9 tokens</li><li>mean:
|
232 |
* Samples:
|
233 |
-
| sentence_0
|
234 |
-
|
235 |
-
| <code>
|
236 |
-
| <code>
|
237 |
-
| <code>
|
238 |
* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
|
239 |
```json
|
240 |
{
|
@@ -248,7 +248,7 @@ You can finetune this model on your own dataset.
|
|
248 |
|
249 |
- `per_device_train_batch_size`: 10
|
250 |
- `per_device_eval_batch_size`: 10
|
251 |
-
- `num_train_epochs`:
|
252 |
- `multi_dataset_batch_sampler`: round_robin
|
253 |
|
254 |
#### All Hyperparameters
|
@@ -271,7 +271,7 @@ You can finetune this model on your own dataset.
|
|
271 |
- `adam_beta2`: 0.999
|
272 |
- `adam_epsilon`: 1e-08
|
273 |
- `max_grad_norm`: 1
|
274 |
-
- `num_train_epochs`:
|
275 |
- `max_steps`: -1
|
276 |
- `lr_scheduler_type`: linear
|
277 |
- `lr_scheduler_kwargs`: {}
|
@@ -369,38 +369,14 @@ You can finetune this model on your own dataset.
|
|
369 |
</details>
|
370 |
|
371 |
### Training Logs
|
372 |
-
| Epoch | Step
|
373 |
-
|
374 |
-
| 0.1651 | 500
|
375 |
-
| 0.3301 | 1000
|
376 |
-
| 0.4952 | 1500
|
377 |
-
| 0.6603 | 2000
|
378 |
-
| 0.8254 | 2500
|
379 |
-
| 0.9904 | 3000
|
380 |
-
| 1.1555 | 3500 | 0.0 |
|
381 |
-
| 1.3206 | 4000 | 0.0 |
|
382 |
-
| 1.4856 | 4500 | 0.0002 |
|
383 |
-
| 1.6507 | 5000 | 0.0003 |
|
384 |
-
| 1.8158 | 5500 | 0.0003 |
|
385 |
-
| 1.9809 | 6000 | 0.0 |
|
386 |
-
| 2.1459 | 6500 | 0.0 |
|
387 |
-
| 2.3110 | 7000 | 0.0 |
|
388 |
-
| 2.4761 | 7500 | 0.0 |
|
389 |
-
| 2.6411 | 8000 | 0.0003 |
|
390 |
-
| 2.8062 | 8500 | 0.0003 |
|
391 |
-
| 2.9713 | 9000 | 0.0 |
|
392 |
-
| 3.1363 | 9500 | 0.0 |
|
393 |
-
| 3.3014 | 10000 | 0.0 |
|
394 |
-
| 3.4665 | 10500 | 0.0002 |
|
395 |
-
| 3.6316 | 11000 | 0.0003 |
|
396 |
-
| 3.7966 | 11500 | 0.0003 |
|
397 |
-
| 3.9617 | 12000 | 0.0 |
|
398 |
-
| 4.1268 | 12500 | 0.0 |
|
399 |
-
| 4.2918 | 13000 | 0.0 |
|
400 |
-
| 4.4569 | 13500 | 0.0 |
|
401 |
-
| 4.6220 | 14000 | 0.0003 |
|
402 |
-
| 4.7871 | 14500 | 0.0003 |
|
403 |
-
| 4.9521 | 15000 | 0.0 |
|
404 |
|
405 |
|
406 |
### Framework Versions
|
|
|
12 |
- dataset_size:30288
|
13 |
- loss:MultipleNegativesRankingLoss
|
14 |
widget:
|
15 |
+
- source_sentence: 'Looks like a small cockroach, but much more colorful, 0.75" long.
|
16 |
+
[Atlanta, Georgia] '
|
17 |
sentences:
|
18 |
+
- 'Help me win a bet: What size gi does Marcelo Garcia wear? I suspect he uses different
|
19 |
+
size pants relative to the gi-top because of his epic thighs relative to stature.
|
20 |
+
My buddy just says A2 all around (on average, recognizing that it varies by brand).
|
21 |
+
What do you say? '
|
22 |
+
- 'What little things about the Star Wars Universe do you love? '
|
23 |
+
- 'Looks like a small cockroach, but much more colorful, 0.75" long. [Atlanta, Georgia] '
|
24 |
+
- source_sentence: "Clogged Construction on my brand new condo finished this summer.\
|
25 |
+
\ Not wasting a second, I broke lease on my musky apartment, and moved in as soon\
|
26 |
+
\ as possible. I rather enjoyed knowing I was the first resident living here:\
|
27 |
+
\ there was no wear and tear, no smoke stains on the walls, and no damage to the\
|
28 |
+
\ structure. The only issue was a light clattering sound whenever I used the commercial\
|
29 |
+
\ sink in my laundry room. I rarely used it, so I didn't bring up the problem\
|
30 |
+
\ to the contractors. Everything else worked perfectly, and my home was as sterile\
|
31 |
+
\ as an operating table.\n\n\n\n nbsp;\n\n\n\nAfter a few months, I began noticing\
|
32 |
+
\ water pooling at the foot of my shower. The drain must have been clogged. I\
|
33 |
+
\ took to my tools, unscrewed the shower drain, and peered inside. I could see\
|
34 |
+
\ a collection of fibers bunched up in the pipes. Reaching in with an unfolded\
|
35 |
+
\ coat hanger, I pulled out mountains of dirty blond hair clogging the pipes.\
|
36 |
+
\ I live alone, I don't have any pets, I haven't entertained a lady in over a\
|
37 |
+
\ year, and I've been bald since I was 27.\n\n\n\n nbsp;\n\n\n\nThe odd phenomena\
|
38 |
+
\ got me thinking about the sink in the laundry room. I detached the aerator,\
|
39 |
+
\ placed my hand under the faucet, and turned on the water. Dozens of molars came\
|
40 |
+
\ flying out, slipping through my fingers and into the sink, bouncing up and down\
|
41 |
+
\ until ultimately falling down the drain.\n\n\n\n nbsp;\n\n\n\nOn a completely\
|
42 |
+
\ unrelated note: I have a beautiful, fully furnished, barely-used condo for sale.\
|
43 |
+
\ Located in downtown Detroit. Anyone interested? "
|
44 |
sentences:
|
45 |
+
- '3-2 defense cannot stop corner 3s? Does anyone else have this problem? My down
|
46 |
+
low guys won''t kick out to even try to defend an open 3 shot, and the computer
|
47 |
+
just spams this on me all day when I play offline. '
|
48 |
+
- tw.being suicidal but knowing someone whos commit is the worst thing in the world.
|
49 |
+
bc you see both sides. you see how it affects the people that love that person.
|
50 |
+
including yourself. you see how it doesnt end the pain but it just passes it on
|
51 |
+
to all the people who are left to deal with it. but then it also makes it so much
|
52 |
+
more understandable as to why someone did it. you know what its like to want the
|
53 |
+
pain to end. the feeling of your brain sabotaging you and your happiness constantly.
|
54 |
+
to stop feeling like youre drowning in yourself. you get each and every point
|
55 |
+
to it. and in a sense it makes me feel even more guilty for ever having the thought
|
56 |
+
in the first place. for it becoming my safe space. knowing that if things dont
|
57 |
+
fall into place that im okay with not being here anymore but not being okay leaving
|
58 |
+
the people you love to clean up the mess / carry it with them for the rest of
|
59 |
+
their lives. sorry. end rant.
|
60 |
+
- "Clogged Construction on my brand new condo finished this summer. Not wasting\
|
61 |
+
\ a second, I broke lease on my musky apartment, and moved in as soon as possible.\
|
62 |
+
\ I rather enjoyed knowing I was the first resident living here: there was no\
|
63 |
+
\ wear and tear, no smoke stains on the walls, and no damage to the structure.\
|
64 |
+
\ The only issue was a light clattering sound whenever I used the commercial sink\
|
65 |
+
\ in my laundry room. I rarely used it, so I didn't bring up the problem to the\
|
66 |
+
\ contractors. Everything else worked perfectly, and my home was as sterile as\
|
67 |
+
\ an operating table.\n\n\n\n nbsp;\n\n\n\nAfter a few months, I began noticing\
|
68 |
+
\ water pooling at the foot of my shower. The drain must have been clogged. I\
|
69 |
+
\ took to my tools, unscrewed the shower drain, and peered inside. I could see\
|
70 |
+
\ a collection of fibers bunched up in the pipes. Reaching in with an unfolded\
|
71 |
+
\ coat hanger, I pulled out mountains of dirty blond hair clogging the pipes.\
|
72 |
+
\ I live alone, I don't have any pets, I haven't entertained a lady in over a\
|
73 |
+
\ year, and I've been bald since I was 27.\n\n\n\n nbsp;\n\n\n\nThe odd phenomena\
|
74 |
+
\ got me thinking about the sink in the laundry room. I detached the aerator,\
|
75 |
+
\ placed my hand under the faucet, and turned on the water. Dozens of molars came\
|
76 |
+
\ flying out, slipping through my fingers and into the sink, bouncing up and down\
|
77 |
+
\ until ultimately falling down the drain.\n\n\n\n nbsp;\n\n\n\nOn a completely\
|
78 |
+
\ unrelated note: I have a beautiful, fully furnished, barely-used condo for sale.\
|
79 |
+
\ Located in downtown Detroit. Anyone interested? "
|
80 |
+
- source_sentence: 'Top 10 Movies Trailers of 2017 Must watch It '
|
81 |
sentences:
|
82 |
+
- Im on coke n 2 mg kpin and im anxious as fuckIdk what i can do to get rid of this
|
83 |
+
i know coke doesnt last long but the anxietys lingering n the kpins are keeping
|
84 |
+
me borderline okay, but I've never been this anxious on coke i feel like im on
|
85 |
+
a psychedelic having a bad trip but im not tripping its just the anxiety. Can
|
86 |
+
anyone help me thru this
|
87 |
+
- '[Giveaway] 10 BTS for new users '
|
88 |
+
- 'Top 10 Movies Trailers of 2017 Must watch It '
|
89 |
+
- source_sentence: 'Vet says he nearly operated on himself when VA wouldn''t pay medical
|
90 |
+
bill. '
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
91 |
sentences:
|
92 |
+
- 'What kind of soap is best to get glitter off your skin? '
|
93 |
+
- 'Alvvays is nearly done tracking their next album '
|
94 |
+
- 'Vet says he nearly operated on himself when VA wouldn''t pay medical bill. '
|
95 |
+
- source_sentence: Age old questions[View Poll](https://www.reddit.com/poll/m89hf3)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
96 |
sentences:
|
97 |
+
- "GUYS I MIGHT HAVE TO DELETE THIS ACCOUNT BECAUSE MY BF KNOWS MY ACC BUT I DON'T\
|
98 |
+
\ WANT TO IT'S A MASSIVE URGENCE I'VE HAD THIS 3 YEARS So basically me and my\
|
99 |
+
\ boyfriend was messing around but he decided to go onto my reddit app and he\
|
100 |
+
\ \"accidently\" saw my reddit account name and he said that he's not going to\
|
101 |
+
\ look cause he knows he won't like what he sees but GUYS my post history is fucked\
|
102 |
+
\ i'm fucked it makes me look more fucked then I am what the fuck do i dooooo\
|
103 |
+
\ D:\n\nI don't wanna start over and there's a couple of subreddits that are suscriber\
|
104 |
+
\ only so how the fuck am i gonna get back \n\nhe's said he's been curious about\
|
105 |
+
\ this before but he knows the sorta stuff i post and he said it would really\
|
106 |
+
\ upset him but when he's curios he usally won't stop wondering but I like to\
|
107 |
+
\ think that i can trust him but I''m complety FUCKED. \n\napparently he forgot\
|
108 |
+
\ it too but he has good memory "
|
109 |
+
- Age old questions[View Poll](https://www.reddit.com/poll/m89hf3)
|
110 |
+
- 'Who else is in a opposite gender dominated industry? What have been your experiences?
|
111 |
+
I am a female in IT. I chose this field because I enjoy it, and it turns out I
|
112 |
+
am good at it. I am not concerned about the gender bias because I feel my qualifications
|
113 |
+
and experience speak for themselves, and so far that has been the case (the only
|
114 |
+
time I have been discriminated against it has not affected my career progress).
|
115 |
+
However, I''m relatively inexperienced and I would love to know other people''s
|
116 |
+
experiences in similar environments. '
|
|
|
117 |
---
|
118 |
|
119 |
# SentenceTransformer based on manuel-couto-pintos/roberta_erisk
|
|
|
165 |
model = SentenceTransformer("manuel-couto-pintos/roberta_erisk_simcse")
|
166 |
# Run inference
|
167 |
sentences = [
|
168 |
+
'Age old questions[View Poll](https://www.reddit.com/poll/m89hf3)',
|
169 |
+
'Age old questions[View Poll](https://www.reddit.com/poll/m89hf3)',
|
170 |
+
"Who else is in a opposite gender dominated industry? What have been your experiences? I am a female in IT. I chose this field because I enjoy it, and it turns out I am good at it. I am not concerned about the gender bias because I feel my qualifications and experience speak for themselves, and so far that has been the case (the only time I have been discriminated against it has not affected my career progress). However, I'm relatively inexperienced and I would love to know other people's experiences in similar environments. ",
|
171 |
]
|
172 |
embeddings = model.encode(sentences)
|
173 |
print(embeddings.shape)
|
|
|
228 |
| | sentence_0 | sentence_1 |
|
229 |
|:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
|
230 |
| type | string | string |
|
231 |
+
| details | <ul><li>min: 9 tokens</li><li>mean: 84.36 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 9 tokens</li><li>mean: 84.36 tokens</li><li>max: 512 tokens</li></ul> |
|
232 |
* Samples:
|
233 |
+
| sentence_0 | sentence_1 |
|
234 |
+
|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
235 |
+
| <code>Actor Cory Monteith, Who Played Finn Hudson On 'Glee,' Found Dead </code> | <code>Actor Cory Monteith, Who Played Finn Hudson On 'Glee,' Found Dead </code> |
|
236 |
+
| <code>Is the AW3420DW worth double the cost of a $500 monitor?I've been researching ultrawides and wanted to know people's opinion if the extra cost for the [Alienware AW3420DW ($999)](https://www.microcenter.com/product/620684/dell-alienware-aw3420dw-34-wqhd-120hz-hdmi-dp-g-sync--curved-ips-led-gaming-monitor) was worth the extra over say a [AOC CU34G2X ($449)](https://www.microcenter.com/product/618536/aoc-cu34g2x-34-qhd-144hz-hdmi-dp-freesync-ultrawide-curved-led-gaming-monitor) or [BenQ EX3501R ($649)](https://www.bhphotovideo.com/c/product/1383775-REG/benq_ex3501r_premium_grey_35_va_3440x1440.html) or another monitor in that range? If I'm willing to spend the cash for the Alienware, should I just make the leap?</code> | <code>Is the AW3420DW worth double the cost of a $500 monitor?I've been researching ultrawides and wanted to know people's opinion if the extra cost for the [Alienware AW3420DW ($999)](https://www.microcenter.com/product/620684/dell-alienware-aw3420dw-34-wqhd-120hz-hdmi-dp-g-sync--curved-ips-led-gaming-monitor) was worth the extra over say a [AOC CU34G2X ($449)](https://www.microcenter.com/product/618536/aoc-cu34g2x-34-qhd-144hz-hdmi-dp-freesync-ultrawide-curved-led-gaming-monitor) or [BenQ EX3501R ($649)](https://www.bhphotovideo.com/c/product/1383775-REG/benq_ex3501r_premium_grey_35_va_3440x1440.html) or another monitor in that range? If I'm willing to spend the cash for the Alienware, should I just make the leap?</code> |
|
237 |
+
| <code>My first time making it to a week! Awesome! Nothing to say, just felt like sharing(: Have a good day!<br><br><br><br>**EDIT:** Oh my gosh, I meant to say month... Woops. </code> | <code>My first time making it to a week! Awesome! Nothing to say, just felt like sharing(: Have a good day!<br><br><br><br>**EDIT:** Oh my gosh, I meant to say month... Woops. </code> |
|
238 |
* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
|
239 |
```json
|
240 |
{
|
|
|
248 |
|
249 |
- `per_device_train_batch_size`: 10
|
250 |
- `per_device_eval_batch_size`: 10
|
251 |
+
- `num_train_epochs`: 1
|
252 |
- `multi_dataset_batch_sampler`: round_robin
|
253 |
|
254 |
#### All Hyperparameters
|
|
|
271 |
- `adam_beta2`: 0.999
|
272 |
- `adam_epsilon`: 1e-08
|
273 |
- `max_grad_norm`: 1
|
274 |
+
- `num_train_epochs`: 1
|
275 |
- `max_steps`: -1
|
276 |
- `lr_scheduler_type`: linear
|
277 |
- `lr_scheduler_kwargs`: {}
|
|
|
369 |
</details>
|
370 |
|
371 |
### Training Logs
|
372 |
+
| Epoch | Step | Training Loss |
|
373 |
+
|:------:|:----:|:-------------:|
|
374 |
+
| 0.1651 | 500 | 0.8614 |
|
375 |
+
| 0.3301 | 1000 | 0.0012 |
|
376 |
+
| 0.4952 | 1500 | 0.0007 |
|
377 |
+
| 0.6603 | 2000 | 0.0002 |
|
378 |
+
| 0.8254 | 2500 | 0.0002 |
|
379 |
+
| 0.9904 | 3000 | 0.0 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
380 |
|
381 |
|
382 |
### Framework Versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 498604904
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ab31625c6baf4eef65e6ba1581f7f3d70e7cff6a7095ed1f58f4bbe4f8cfa81a
|
3 |
size 498604904
|