INDEX
Explanations
phrases indicating stories or narratives
instances of the word "tale" and variations thereof
New Auto-Interp
Negative Logits
erate
-0.96
ividual
-0.80
bledon
-0.78
atever
-0.74
ournal
-0.73
ulty
-0.73
apsed
-0.73
eeks
-0.73
ussy
-0.73
uters
-0.72
POSITIVE LOGITS
tales
1.27
tale
1.23
tale
1.05
Tale
1.01
Tales
1.00
Reincarn
0.91
tell
0.86
telling
0.77
Narr
0.75
crow
0.74
Activations Density 0.027%