INDEX
Explanations
trigger words signaling a transition or change in topic
instances of the word "Before."
New Auto-Interp
Negative Logits
pers
-0.72
asc
-0.70
perpet
-0.65
fest
-0.60
unspecified
-0.59
trail
-0.59
cent
-0.58
swe
-0.58
Noble
-0.57
reincarn
-0.57
POSITIVE LOGITS
Before
3.09
Before
2.10
before
1.90
After
1.78
Prior
1.64
During
1.64
Without
1.64
Until
1.60
Previously
1.52
Since
1.52
Activations Density 0.020%