INDEX
Explanations
references to family, health issues, and financial matters within the text
New Auto-Interp
Negative Logits
hra
-0.15
rei
-0.14
oku
-0.14
代çIJĨ
-0.14
loth
-0.14
Ế
-0.14
atus
-0.13
大åħ¨
-0.13
hangi
-0.13
áº
-0.13
POSITIVE LOGITS
from
0.48
à¸Īาà¸ģà¸ģาร
0.38
from
0.33
från
0.32
từ
0.30
from
0.29
dari
0.29
à¸Īาà¸ģ
0.29
æĿ¥èĩª
0.28
FROM
0.27
Activations Density 0.250%