INDEX
Explanations
possessive adjectives and pronouns related to individuals
New Auto-Interp
Negative Logits
owo
-0.09
eld
-0.08
ugs
-0.07
hai
-0.07
olves
-0.07
thiên
-0.07
aidu
-0.06
teÅŁkil
-0.06
uzey
-0.06
ovo
-0.06
POSITIVE LOGITS
in
0.07
case
0.06
absence
0.06
amet
0.06
cleanup
0.06
ÙĪØªÛĮ
0.06
ammer
0.06
338
0.06
agr
0.06
spare
0.06
Activations Density 0.017%