INDEX
Explanations
mentions of political figures and current events related to government policies
New Auto-Interp
Negative Logits
aloud
-0.70
peak
-0.66
lins
-0.66
hooting
-0.65
LEASE
-0.63
deals
-0.62
accordingly
-0.62
duction
-0.62
uate
-0.62
besides
-0.61
POSITIVE LOGITS
opportunity
1.21
same
1.15
slightest
1.10
ability
1.08
utmost
1.07
highest
1.04
requisite
1.01
lowest
0.99
widest
0.99
greatest
0.99
Activations Density 0.086%