INDEX
Explanations
significant emotional or impactful themes related to storytelling and character experiences
New Auto-Interp
Negative Logits
ieten
-0.18
irsch
-0.18
icken
-0.16
pedia
-0.15
_TA
-0.15
iaux
-0.15
iete
-0.15
ýt
-0.15
ainer
-0.14
edin
-0.14
POSITIVE LOGITS
P
0.17
659
0.17
ac
0.16
atik
0.15
NS
0.15
帰
0.15
rio
0.15
NC
0.14
Scale
0.14
bob
0.14
Activations Density 0.051%