INDEX
Explanations
proper nouns, particularly related to locations and names
New Auto-Interp
Negative Logits
erts
-0.16
ÙĦÙĪØ¯
-0.16
atur
-0.15
eree
-0.15
ister
-0.14
ether
-0.14
imal
-0.14
informant
-0.14
es
-0.14
fund
-0.14
POSITIVE LOGITS
shire
0.30
NodeType
0.18
arian
0.18
aser
0.17
ilian
0.16
bury
0.16
asers
0.16
vale
0.16
isas
0.15
eshire
0.15
Activations Density 0.060%