INDEX
Explanations
proper nouns, specifically the name "Richard"
occurrences of the name "Richard."
New Auto-Interp
Negative Logits
sites
-0.72
è¦ļéĨĴ
-0.67
ramid
-0.66
joint
-0.65
cedented
-0.64
worn
-0.64
icago
-0.63
TOR
-0.62
ipeg
-0.62
£ı
-0.61
POSITIVE LOGITS
Richard
1.12
Richard
0.95
Dawkins
0.89
Pryor
0.85
Allan
0.81
Johnson
0.80
Neville
0.80
Wallace
0.79
Spencer
0.76
wine
0.76
Activations Density 0.010%