INDEX
Explanations
phrases related to common storytelling tropes and clichés
terms related to narrative tropes and clichés
New Auto-Interp
Negative Logits
gur
-0.90
hand
-0.72
brew
-0.71
ilated
-0.70
trials
-0.69
Ethiopian
-0.69
gan
-0.68
vals
-0.68
ander
-0.66
imentary
-0.66
POSITIVE LOGITS
pmwiki
1.35
trope
1.26
tropes
1.14
enegger
0.87
clich
0.86
invoked
0.77
witz
0.77
OPLE
0.73
-+-+
0.69
spelled
0.67
Activations Density 0.025%