INDEX
Explanations
phrases indicating continuation or persistence
instances of the word "continue" in various contexts
New Auto-Interp
Negative Logits
usterity
-0.70
adish
-0.70
idden
-0.68
aster
-0.67
soc
-0.65
typ
-0.64
ister
-0.64
ummer
-0.64
ranch
-0.64
Sheikh
-0.63
POSITIVE LOGITS
continued
0.82
uninterrupted
0.81
repeating
0.79
unanswered
0.78
unab
0.78
ende
0.76
continues
0.75
reiter
0.74
unchanged
0.74
Continue
0.74
Activations Density 0.026%