INDEX
Explanations
instances where a character takes an action that leads to a specific outcome
the repeated phrase "only" followed by varying negative or adverse circumstances
New Auto-Interp
Negative Logits
cv
-0.64
iller
-0.62
insula
-0.61
ahead
-0.61
rect
-0.60
ahime
-0.59
lass
-0.59
ji
-0.58
CV
-0.58
atical
-0.57
POSITIVE LOGITS
kidding
0.93
marginally
0.89
minutes
0.79
weeks
0.76
seconds
0.76
incidentally
0.75
spor
0.73
realizing
0.73
moments
0.73
intermitt
0.72
Activations Density 0.049%