INDEX
Explanations
negative sentiments or outcomes
New Auto-Interp
Negative Logits
bArr
-0.74
WireFormatLite
-0.65
Conroy
-0.63
rozum
-0.58
helfen
-0.58
ITUTION
-0.56
ึ้น
-0.56
şk
-0.55
blij
-0.54
idopsis
-0.53
POSITIVE LOGITS
negative
2.65
Negative
2.42
negative
2.38
Negative
2.28
NEGATIVE
2.17
negatives
2.16
NEGATIVE
2.10
negativity
1.96
negativo
1.93
négatif
1.87
Activations Density 0.085%