INDEX
Explanations
because, inability, or reason
New Auto-Interp
Negative Logits
المستخدم
0.43
utilisées
0.42
продукт
0.42
them
0.42
orientated
0.41
placés
0.41
ጩ
0.41
المطل
0.41
తులను
0.41
allons
0.40
POSITIVE LOGITS
incapacity
0.53
inability
0.52
incapac
0.51
بسبب
0.50
paraly
0.49
stuck
0.47
Stuck
0.47
powodu
0.46
paralysis
0.44
तबीयत
0.44
Activations Density 0.001%