INDEX
Explanations
politicians, political figures, and related terms
mentions of prominent political figures and entities
New Auto-Interp
Negative Logits
ģĸ
-0.69
ãĥ¯
-0.64
Oracle
-0.63
thens
-0.63
tml
-0.62
pires
-0.61
Mehran
-0.60
Berry
-0.59
asury
-0.57
uers
-0.56
POSITIVE LOGITS
being
0.94
's
0.93
behaving
0.86
dying
0.86
needing
0.85
involvement
0.84
disappearing
0.83
losing
0.82
versus
0.82
influencing
0.80
Activations Density 0.794%