INDEX
Explanations
instances of giving up or despair
New Auto-Interp
Negative Logits
IDEO
-0.15
aras
-0.14
ocrates
-0.14
deo
-0.14
ales
-0.14
alian
-0.14
discomfort
-0.14
loh
-0.14
.ss
-0.13
æīķ
-0.13
POSITIVE LOGITS
despair
0.38
hopeless
0.34
resigned
0.26
resign
0.23
hope
0.21
Hope
0.21
abandon
0.21
hope
0.20
discouraged
0.20
Hope
0.20
Activations Density 0.207%