INDEX
Explanations
negative qualifiers or negations related to political or economic contexts
New Auto-Interp
Negative Logits
urm
-0.15
uchs
-0.14
aji
-0.14
olie
-0.14
.workflow
-0.14
nowhere
-0.13
487
-0.13
ocs
-0.13
kinson
-0.13
ical
-0.13
POSITIVE LOGITS
merely
0.17
ektor
0.16
iot
0.15
eyi
0.15
ymmetric
0.14
mere
0.14
OMIT
0.14
BCM
0.14
_via
0.14
mere
0.14
Activations Density 0.083%