INDEX
Explanations
information related to legal cases and interventions
sentences that relate to political or social statements and events
New Auto-Interp
Negative Logits
democrat
-0.86
kefeller
-0.75
tremend
-0.70
disson
-0.68
dimensional
-0.66
scouting
-0.65
insur
-0.65
exciting
-0.65
unequ
-0.65
imaginable
-0.65
POSITIVE LOGITS
Asked
1.08
Others
1.07
Later
1.01
However
1.01
Earlier
1.01
Initially
1.01
Previously
1.00
Officials
1.00
Instead
0.99
Though
0.96
Activations Density 0.624%