INDEX
Explanations
expressions of positive feelings and progress in personal training or rehabilitation
positive expressions related to personal well-being and progress
New Auto-Interp
Negative Logits
Holocaust
-0.86
centuries
-0.79
outlawed
-0.77
scorn
-0.76
lest
-0.75
Nobel
-0.74
violate
-0.72
undesirable
-0.72
euphem
-0.72
colonial
-0.72
POSITIVE LOGITS
progressing
0.89
Coach
0.89
iets
0.85
Hopefully
0.85
Been
0.85
hopefully
0.83
Hopefully
0.82
Working
0.80
--+
0.78
everything
0.76
Activations Density 0.658%