INDEX
Explanations
mentions of specific individuals, such as political figures
names and titles of prominent individuals, particularly in political contexts
New Auto-Interp
Negative Logits
ĸļ
-0.73
BMC
-0.69
pta
-0.68
gui
-0.63
Crim
-0.60
Citiz
-0.60
odder
-0.60
[|
-0.59
ipeg
-0.58
ancial
-0.57
POSITIVE LOGITS
Corker
0.92
fman
0.78
socket
0.66
Tillerson
0.65
Cosponsors
0.65
Lago
0.65
boycot
0.64
uranium
0.64
Flake
0.63
Schumer
0.63
Activations Density 0.837%