INDEX
Explanations
words indicating refusal or negation in decision-making contexts
New Auto-Interp
Negative Logits
pinulongan
-0.60
GenerationType
-0.57
iecie
-0.56
zd
-0.53
soort
-0.53
Drapeau
-0.52
τρο
-0.51
leep
-0.51
focal
-0.50
Pin
-0.50
POSITIVE LOGITS
melakukannya
0.76
devamını
0.65
fernández
0.61
aikaa
0.60
这样做
0.59
partecipare
0.59
solches
0.58
tričko
0.58
eccell
0.58
concernés
0.58
Activations Density 0.421%