INDEX
Explanations
references to a specific brand of motorcycle
references to the brand Harley-Davidson
New Auto-Interp
Negative Logits
mble
-1.11
xual
-0.87
ozy
-0.86
eering
-0.80
ournal
-0.79
yip
-0.76
urity
-0.73
Languages
-0.72
omsky
-0.71
ocol
-0.71
POSITIVE LOGITS
Davidson
0.89
Girls
0.74
Quinn
0.74
girl
0.71
shire
0.70
Harley
0.70
GH
0.68
nuts
0.67
woman
0.67
Hath
0.67
Activations Density 0.013%