INDEX
Explanations
phrases expressing hope or desire for positive outcomes or experiences
New Auto-Interp
Negative Logits
Geological
-0.71
cles
-0.68
FML
-0.66
uary
-0.64
formerly
-0.64
VIEW
-0.64
arget
-0.64
force
-0.63
Link
-0.62
worth
-0.62
POSITIVE LOGITS
miracles
0.93
perfection
0.85
someday
0.82
imminent
0.81
speedy
0.81
consistency
0.80
success
0.79
survival
0.78
future
0.78
salvation
0.78
Activations Density 0.031%