INDEX
Explanations
proper nouns
specific names and terms related to a particular individual or organization
New Auto-Interp
Negative Logits
Murdoch
-0.74
Osw
-0.70
OPLE
-0.70
Pas
-0.69
phrine
-0.67
PORT
-0.67
Strauss
-0.65
cean
-0.65
UNCH
-0.64
chefs
-0.63
POSITIVE LOGITS
rha
0.83
onial
0.83
aah
0.81
thens
0.81
ials
0.79
sburgh
0.77
obo
0.76
rador
0.75
iations
0.74
onite
0.74
Activations Density 0.026%