INDEX
Explanations
phrases related to progress, setbacks, and their respective impacts on motivation and performance
New Auto-Interp
Negative Logits
avení
-0.41
HasAnnotation
-0.41
overjoyed
-0.41
цов
-0.40
transQ
-0.40
glad
-0.39
epik
-0.38
eln
-0.38
Glad
-0.37
rejoiced
-0.37
POSITIVE LOGITS
negative
0.90
negative
0.87
negatives
0.85
Negative
0.79
Negative
0.79
الحره
0.76
negation
0.74
negatively
0.74
Gegens
0.73
negativos
0.72
Activations Density 0.813%