INDEX
Explanations
themes within various contexts, such as literature, music, and movies
references to recurring themes in various contexts
New Auto-Interp
Negative Logits
effective
-0.72
carry
-0.67
nder
-0.64
downed
-0.64
undo
-0.64
bow
-0.63
bil
-0.63
raction
-0.62
mit
-0.61
tesy
-0.61
POSITIVE LOGITS
themes
3.89
theme
2.64
Theme
1.99
theme
1.95
Theme
1.94
motif
1.91
tropes
1.71
topics
1.70
themed
1.44
storylines
1.43
Activations Density 0.017%