INDEX
Explanations
mentions of specific names or handles in a conversation or text
references and discussions related to political figures and events
New Auto-Interp
Negative Logits
д
-0.88
cel
-0.85
Ì
-0.83
ÑĤ
-0.82
Sloven
-0.82
Li
-0.82
selfies
-0.80
л
-0.79
Lithuan
-0.78
Unity
-0.78
POSITIVE LOGITS
Bush
2.22
Bush
2.08
bush
1.52
Cheney
1.32
bush
1.31
George
1.27
Jeb
1.23
Rove
1.20
Gore
1.20
Saddam
1.14
Activations Density 0.328%