INDEX
Explanations
references to George W. Bush
New Auto-Interp
Negative Logits
Shreve
-0.82
nguyễn
-0.78
ransition
-0.77
atedral
-0.75
kler
-0.74
McKinley
-0.74
avits
-0.73
ligators
-0.72
Chall
-0.72
Jakub
-0.71
POSITIVE LOGITS
George
2.21
George
1.99
george
1.80
GEORGE
1.76
george
1.70
GEORGE
1.59
Georges
1.58
Georges
1.29
Geo
1.20
Georgie
1.18
Activations Density 0.016%