INDEX
Explanations
phrases expressing possession or belonging
New Auto-Interp
Negative Logits
stag
-0.15
imar
-0.15
ilib
-0.15
ikan
-0.15
ille
-0.15
Nag
-0.14
imm
-0.14
Äįen
-0.14
ulo
-0.14
Ùĩ
-0.14
POSITIVE LOGITS
Resolved
0.16
ohana
0.15
ylon
0.15
nosis
0.15
why
0.15
ufen
0.14
arella
0.14
ãĥ¼ãĥĦ
0.14
rible
0.14
麼
0.14
Activations Density 0.066%