INDEX
Explanations
terms and phrases that express strong emotions or reactions, particularly negative ones
New Auto-Interp
Negative Logits
Masyarakat
-0.57
conformément
-0.54
Erbe
-0.51
Kecil
-0.49
GEBURTSDATUM
-0.48
nahilalakip
-0.47
DockStyle
-0.47
Geduld
-0.47
aanwezig
-0.46
Calidad
-0.46
POSITIVE LOGITS
squ
0.62
sp
0.60
monkey
0.57
fla
0.56
fl
0.56
mis
0.56
sk
0.55
Squ
0.54
bl
0.54
Sprintf
0.53
Activations Density 0.060%