INDEX
Explanations
proper nouns, specifically names of individuals or entities
New Auto-Interp
Negative Logits
ãĤ¼ãĤ¦ãĤ¹
-0.91
ãĥ´ãĤ¡
-0.73
ãĤ¦ãĤ¹
-0.66
partName
-0.64
issance
-0.63
shown
-0.63
ा
-0.62
metic
-0.62
Janeiro
-0.61
INA
-0.60
POSITIVE LOGITS
zinski
0.70
acket
0.61
pson
0.61
Samurai
0.59
bley
0.59
iott
0.59
otos
0.58
bye
0.58
kes
0.58
oller
0.57
Activations Density 0.032%