INDEX
Explanations
proper nouns, specifically names of people and places
references to specific names or identities associated with individuals and groups
New Auto-Interp
Negative Logits
Finch
-0.65
Welsh
-0.58
ãĤ¢ãĥ«
-0.58
ãĤ½
-0.58
ãĥł
-0.57
mental
-0.56
Eleanor
-0.56
Beaver
-0.55
Cath
-0.55
Ecc
-0.54
POSITIVE LOGITS
obin
1.10
chev
0.92
enei
0.90
arov
0.87
afi
0.84
src
0.82
ali
0.78
ijah
0.77
¦
0.77
nir
0.76
Activations Density 0.059%