INDEX
Explanations
interactions and relationships among characters in a narrative
New Auto-Interp
Negative Logits
strup
-0.17
aines
-0.15
dana
-0.14
alian
-0.14
ÑĢеж
-0.14
uilder
-0.13
udev
-0.13
fashion
-0.13
rido
-0.13
tuy
-0.13
POSITIVE LOGITS
å¹¶
0.25
ï¼Į並
0.23
å¹¶
0.23
ï¼Įå¹¶
0.23
)&&
0.21
ìĿ´ê³ł
0.21
à¹ģล
0.20
çĦ¶åIJİ
0.20
並
0.18
ìŀĪê³ł
0.18
Activations Density 0.534%