INDEX
Explanations
phrases related to memories or historical events
words or phrases related to narratives and storytelling
New Auto-Interp
Negative Logits
rab
-0.66
rella
-0.61
Beg
-0.61
Topics
-0.60
inton
-0.58
Lot
-0.58
decency
-0.57
oway
-0.57
zee
-0.56
Isle
-0.56
POSITIVE LOGITS
ynthesis
1.10
ories
1.09
poons
1.07
mith
0.90
hops
0.88
peak
0.86
ynt
0.85
uggest
0.85
pring
0.83
ystem
0.79
Activations Density 0.009%