INDEX
Explanations
references to a person named "Butler"
mentions of a specific individual, particularly "Butler."
New Auto-Interp
Negative Logits
UAL
-0.76
arin
-0.72
clud
-0.71
ning
-0.71
reek
-0.69
ammed
-0.69
European
-0.69
nings
-0.69
particular
-0.68
popular
-0.68
POSITIVE LOGITS
Butler
1.25
Osw
0.78
illard
0.77
McH
0.74
irez
0.73
Hancock
0.73
Maker
0.72
okia
0.70
ipop
0.69
Bulldogs
0.68
Activations Density 0.007%