INDEX
Explanations
negative qualifiers in the context of opinions or statements
New Auto-Interp
Negative Logits
berdayakan
-0.70
DockStyle
-0.64
klingt
-0.62
tâche
-0.60
vocale
-0.60
antaranya
-0.60
itſelf
-0.59
IGraphics
-0.59
Theolog
-0.58
vallée
-0.57
POSITIVE LOGITS
nor
0.69
stanovnika
0.68
Nor
0.62
وتسجيلات
0.61
Nor
0.60
ρυ
0.60
mtrl
0.57
Бахар
0.56
ever
0.55
any
0.54
Activations Density 0.185%