INDEX
Explanations
expressions of decision-making and resolutions
New Auto-Interp
Negative Logits
ipa
-0.15
ichel
-0.14
inati
-0.14
Äįer
-0.14
ван
-0.14
hoa
-0.13
aná
-0.13
iev
-0.13
larım
-0.13
ateg
-0.13
POSITIVE LOGITS
against
0.17
instead
0.16
Against
0.16
лÑĥÑĩ
0.16
instead
0.16
against
0.16
rather
0.16
Against
0.16
Instead
0.15
skoro
0.15
Activations Density 0.026%