INDEX
Explanations
phrases indicating pathways or methods to achieve goals
New Auto-Interp
Negative Logits
VERSE
-0.16
_fault
-0.16
quist
-0.14
zek
-0.14
_ABORT
-0.13
eed
-0.13
ikt
-0.13
verse
-0.13
_reporting
-0.13
upbringing
-0.13
POSITIVE LOGITS
success
0.38
glory
0.35
victory
0.33
greatness
0.33
fame
0.30
success
0.30
Victory
0.26
Success
0.26
adulthood
0.25
Success
0.25
Activations Density 0.360%