INDEX
Explanations
people's names
references to a specific individual named Harris
New Auto-Interp
Negative Logits
rious
-0.78
nces
-0.73
unal
-0.68
UE
-0.68
ition
-0.66
erous
-0.65
yip
-0.64
culosis
-0.63
uesday
-0.62
uers
-0.62
POSITIVE LOGITS
burg
1.19
mann
0.98
abis
0.83
kins
0.81
ippi
0.79
alez
0.78
aday
0.77
bury
0.77
iday
0.76
ONEY
0.76
Activations Density 0.012%