INDEX
Explanations
looking in a direction or at someone
New Auto-Interp
Negative Logits
ྭ
0.96
procura
0.88
dupa
0.84
کاهش
0.84
ያስፈል
0.83
creș
0.83
ጷ
0.82
noting
0.81
絊
0.81
يجب
0.81
POSITIVE LOGITS
izer
0.76
good
0.63
sky
0.61
Good
0.59
good
0.59
man
0.58
lack
0.56
具
0.56
mixte
0.56
Overall
0.56
Activations Density 0.027%