INDEX
Explanations
proper nouns, particularly names and locations
New Auto-Interp
Negative Logits
antro
-0.18
osci
-0.16
jerne
-0.16
ë©´
-0.16
"';
-0.15
alloca
-0.15
Midi
-0.15
Pied
-0.15
arov
-0.15
infeld
-0.14
POSITIVE LOGITS
Newfoundland
0.34
Labrador
0.28
Aval
0.26
NL
0.23
St
0.23
foundland
0.23
NFL
0.22
cod
0.20
rador
0.20
NL
0.20
Activations Density 0.028%