INDEX
Explanations
negative sentiment or expressions of discontent
New Auto-Interp
Negative Logits
tinyos
-0.37
fortawesome
-0.36
altezza
-0.34
不说
-0.30
bautizo
-0.29
aDecoder
-0.29
anhydride
-0.29
""],
-0.29
Krise
-0.29
addPreferredGap
-0.29
POSITIVE LOGITS
uxxxx
0.51
ArgsConstructor
0.50
Personendaten
0.49
ronpa
0.47
.*")]
0.47
RotationOrder
0.46
новниш
0.46
GTCX
0.45
nahilalakip
0.44
gewähr
0.44
Activations Density 0.198%