INDEX
Explanations
references to the Republican strategist Karl Rove
mentions of political figures, particularly Karl Rove
New Auto-Interp
Negative Logits
Constructed
-0.81
Mehran
-0.69
ELF
-0.68
WARE
-0.67
hops
-0.66
malink
-0.65
board
-0.65
Qiao
-0.63
++++++++
-0.62
Examiner
-0.62
POSITIVE LOGITS
Rove
0.99
wolves
0.83
hardt
0.80
rils
0.80
reens
0.78
olf
0.76
encer
0.74
cot
0.74
ril
0.74
ttes
0.74
Activations Density 0.023%