INDEX
Explanations
elements related to character relationships and dynamics in narratives
New Auto-Interp
Negative Logits
hsi
-0.16
arel
-0.15
æĹıèĩªæ²»
-0.15
_ALIGN
-0.15
aina
-0.14
-yyyy
-0.14
rray
-0.14
å¡ļ
-0.14
HeaderCode
-0.14
ä¸ĭåİ»
-0.14
POSITIVE LOGITS
(“
0.15
ızı
0.15
Roth
0.14
whom
0.13
e
0.13
itle
0.13
Bishop
0.13
Harris
0.13
whose
0.13
Kirk
0.13
Activations Density 0.152%