INDEX
Explanations
names associated with individuals and place significance on individuals' contributions or connections to cultural and historical contexts
New Auto-Interp
Negative Logits
ά
-0.20
ipc
-0.17
hq
-0.16
ÑĨÑĮ
-0.15
oly
-0.15
aps
-0.15
alah
-0.15
jen
-0.15
jet
-0.14
arshal
-0.14
POSITIVE LOGITS
shima
0.31
uchi
0.28
awa
0.27
ushima
0.27
aura
0.27
amoto
0.27
ishi
0.26
ura
0.25
asaki
0.25
ami
0.25
Activations Density 0.048%