INDEX
Explanations
names of political figures
mentions of the political figure Jeb Bush
New Auto-Interp
Negative Logits
anguage
-0.68
女
-0.68
ized
-0.66
erie
-0.66
istically
-0.65
ATA
-0.64
DRAG
-0.62
ization
-0.62
isations
-0.61
Myst
-0.61
POSITIVE LOGITS
edia
1.12
ruary
0.94
Bush
0.91
keye
0.87
ilitation
0.86
bler
0.83
bie
0.83
bour
0.81
bia
0.81
clinton
0.80
Activations Density 0.023%