INDEX
Explanations
phrases indicating additional information or examples
phrases that include the term "not to mention."
New Auto-Interp
Negative Logits
rend
-0.84
sis
-0.76
fell
-0.75
odes
-0.75
olid
-0.75
adden
-0.74
rete
-0.74
rame
-0.72
hor
-0.71
oward
-0.71
POSITIVE LOGITS
secondly
0.78
blah
0.68
allergies
0.68
condoms
0.67
plenty
0.66
beware
0.66
additionally
0.65
importantly
0.65
lots
0.64
imagine
0.63
Activations Density 0.100%