INDEX
Explanations
negative sentiment or reluctance
New Auto-Interp
Negative Logits
в
0.93
if
0.93
unico
0.92
old
0.89
error
0.85
potp
0.84
$('#0.84
個性
0.83
oor
0.82
نقص
0.81
POSITIVE LOGITS
жела
0.99
unpleasant
0.99
ualaikum
0.96
mutta
0.95
खासा
0.92
unsurprisingly
0.91
habitual
0.90
inclin
0.90
particularmente
0.90
inclinations
0.90
Activations Density 0.068%