INDEX
Explanations
articles and possessive pronouns in the text
New Auto-Interp
Negative Logits
]={↵-0.14
ergus
-0.14
èĩ
-0.14
maj
-0.14
ibo
-0.14
agua
-0.13
ozem
-0.13
teki
-0.13
νι
-0.13
ósito
-0.13
POSITIVE LOGITS
anto
0.16
swire
0.16
ÑĢоÑĤив
0.15
nor
0.14
orex
0.14
uni
0.13
/MIT
0.13
bis
0.13
esa
0.13
Portions
0.13
Activations Density 1.159%