INDEX
Explanations
phrases or sentences that transition to a new topic or idea
phrases that suggest transitions or conclusions in discussions
New Auto-Interp
Negative Logits
iership
-0.65
agos
-0.64
ricular
-0.64
destro
-0.63
withdrew
-0.60
financed
-0.59
roman
-0.59
tolerated
-0.58
cour
-0.58
vanquished
-0.57
POSITIVE LOGITS
me
0.89
curious
0.80
undrum
0.78
unsur
0.77
intriguing
0.77
doub
0.76
fascinating
0.76
begs
0.76
question
0.74
obvious
0.74
Activations Density 0.138%