INDEX
Explanations
phrases indicating a complete denial or negation of something
phrases that indicate a lack of beneficial impact or relationship
New Auto-Interp
Negative Logits
proverb
-0.71
Knot
-0.69
Debor
-0.67
erva
-0.65
olk
-0.64
Paragu
-0.64
KGB
-0.63
emetery
-0.62
oute
-0.61
ŃĶ
-0.61
POSITIVE LOGITS
whatsoever
1.22
imaginable
0.87
soever
0.77
resembling
0.75
eter
0.74
resembles
0.74
caster
0.73
isal
0.71
construed
0.70
involved
0.70
Activations Density 0.041%