INDEX
Explanations
proper nouns related to people or places, specifically 'Mills'
references to individuals or entities associated with "Mills" and related terms
New Auto-Interp
Negative Logits
liest
-0.73
ctory
-0.72
Falk
-0.71
esses
-0.67
stern
-0.66
gers
-0.65
ctica
-0.64
LAPD
-0.64
Saddam
-0.63
fierce
-0.63
POSITIVE LOGITS
pora
1.07
Mills
0.99
essage
0.97
creen
0.86
onduct
0.84
hirt
0.84
boro
0.80
mallow
0.80
arty
0.79
onductor
0.78
Activations Density 0.016%