INDEX
Explanations
phrases related to characters or names from various fictional stories
character names in a narrative context
New Auto-Interp
Negative Logits
ancial
-0.84
olutions
-0.79
iners
-0.77
atories
-0.75
Companies
-0.75
Questions
-0.74
estyles
-0.73
oms
-0.73
politics
-0.73
odes
-0.72
POSITIVE LOGITS
whom
1.23
who
1.22
whose
1.10
who
1.06
aka
0.98
whose
0.96
clad
0.95
dressed
0.87
Duchess
0.86
blinded
0.86
Activations Density 0.226%