INDEX
Explanations
references to familial concepts or relationships
New Auto-Interp
Negative Logits
stav
-0.18
.xhtml
-0.15
yw
-0.15
Mate
-0.15
ĭ
-0.14
yms
-0.14
iffe
-0.14
.beh
-0.14
èģ
-0.14
inia
-0.14
POSITIVE LOGITS
iglia
0.33
ÃŃlia
0.28
iliar
0.24
ili
0.24
ously
0.23
ished
0.22
iger
0.21
OUS
0.21
igli
0.20
ulus
0.20
Activations Density 0.004%