INDEX
Explanations
phrases where the text discusses the continuation or progression of a previous topic or action
phrases that indicate progression or continuation of a topic
New Auto-Interp
Negative Logits
ullah
-0.72
ipation
-0.60
ateurs
-0.59
alis
-0.58
eers
-0.57
screens
-0.56
cape
-0.56
lake
-0.55
schedules
-0.55
icio
-0.54
POSITIVE LOGITS
verning
0.92
ggle
0.88
overboard
0.88
vt
0.85
lems
0.83
along
0.82
forth
0.76
deeper
0.76
ffer
0.75
farther
0.74
Activations Density 0.091%