INDEX
Explanations
words related to negative or problematic situations, often involving mistakes or failures
negative expressions related to distress and adverse experiences
New Auto-Interp
Negative Logits
izes
-0.89
ively
-0.80
acion
-0.73
eway
-0.73
ership
-0.71
abet
-0.71
inen
-0.70
ivity
-0.70
iveness
-0.69
iola
-0.69
POSITIVE LOGITS
nesday
0.93
uled
0.90
ĸļ
0.80
awake
0.78
away
0.77
out
0.76
GGGGGGGG
0.75
onto
0.75
aback
0.70
CLASSIFIED
0.69
Activations Density 0.174%