INDEX
Explanations
statements or actions related to public figures or political events
New Auto-Interp
Negative Logits
Gems
-0.66
Mile
-0.64
classic
-0.64
services
-0.64
pregn
-0.63
Veter
-0.62
Voltage
-0.62
Claire
-0.62
Mandatory
-0.60
Impact
-0.58
POSITIVE LOGITS
zbollah
1.38
'll
1.30
'd
1.29
resy
1.22
eded
1.11
uristic
1.04
aven
1.03
ather
1.02
ctic
1.02
eding
1.01
Activations Density 2.402%