INDEX
Explanations
phrases related to social issues or political topics
New Auto-Interp
Negative Logits
assisted
-0.89
OWS
-0.78
breakers
-0.77
anism
-0.75
enance
-0.74
eyes
-0.73
ares
-0.70
chairs
-0.70
tags
-0.69
rates
-0.68
POSITIVE LOGITS
lot
1.28
tendency
1.22
possibility
1.21
shortage
1.19
plethora
1.16
definite
1.09
tremendous
1.03
chance
1.01
caveat
1.01
discrepancy
0.99
Activations Density 0.091%