INDEX
Explanations
phrases indicating missed opportunities or events
New Auto-Interp
Negative Logits
ysis
-0.74
vec
-0.72
seek
-0.72
rim
-0.72
onse
-0.69
venge
-0.68
dh
-0.66
rouse
-0.65
levels
-0.65
reth
-0.64
POSITIVE LOGITS
pelled
1.02
something
0.75
uled
0.73
poke
0.73
pell
0.73
LAST
0.71
pelling
0.71
noticing
0.70
Notice
0.65
spotting
0.64
Activations Density 0.012%