INDEX
Explanations
personal names with specific patterns common in South Asian cultures
proper nouns, particularly names of people and places
New Auto-Interp
Negative Logits
overwhelming
-0.69
selective
-0.66
mbuds
-0.64
recursive
-0.63
bearings
-0.63
ambassadors
-0.60
cryst
-0.60
magnification
-0.60
idiots
-0.59
bedrock
-0.59
POSITIVE LOGITS
oglu
1.17
ño
1.06
tsky
1.05
owicz
1.03
udeau
1.02
ova
1.01
ovic
0.99
eva
0.98
ovich
0.97
gui
0.96
Activations Density 0.375%