INDEX
Explanations
negative sentiments or unfavorable assessments
New Auto-Interp
Negative Logits
majánló
-0.84
indígen
-0.76
queſta
-0.70
desmotivaciones
-0.69
MainAxisSize
-0.68
שוליים
-0.67
increí
-0.65
ðsíða
-0.65
httphttps
-0.65
feroit
-0.65
POSITIVE LOGITS
-
0.73
–
0.60
−
0.51
-\
0.50
‐
0.49
‒
0.48
'-
0.47
-
0.47
―
0.44
-,
0.43
Activations Density 0.060%