INDEX
Explanations
proper nouns, particularly likely names of entities or places
instances of the word "An" followed by various nouns or phrases
New Auto-Interp
Negative Logits
ðŁij
-0.73
Bulg
-0.71
è¯
-0.69
poke
-0.69
ï¸ı
-0.67
çĶ
-0.66
Magikarp
-0.64
;)
-0.64
PJ
-0.63
Vaugh
-0.62
POSITIVE LOGITS
esthetic
1.11
onym
1.11
alyses
1.08
alogue
1.08
omal
1.07
alys
1.06
omaly
1.04
ubis
0.97
alyst
0.93
notation
0.92
Activations Density 0.074%