INDEX
Explanations
proper nouns related to locations or people
common prefixes and suffixes in words
New Auto-Interp
Negative Logits
ĩ
-0.69
ī
-0.69
·
-0.68
Citiz
-0.67
«
-0.67
orial
-0.65
İ
-0.65
ĺ
-0.65
OME
-0.64
lvl
-0.64
POSITIVE LOGITS
infring
0.56
spokeswoman
0.56
èĢħ
0.53
accent
0.52
.—
0.52
.(
0.52
contam
0.51
spokesman
0.51
arrives
0.50
!,
0.49
Activations Density 2.068%