INDEX
Explanations
references to characters and relationships in stories or films
New Auto-Interp
Negative Logits
éĮĦ
-0.15
iao
-0.15
Produ
-0.15
mimic
-0.15
amework
-0.14
ubbo
-0.14
еж
-0.14
mî
-0.14
EOF
-0.13
ckill
-0.13
POSITIVE LOGITS
played
0.51
Played
0.46
played
0.42
Played
0.41
voiced
0.30
playable
0.25
-play
0.23
(vo
0.22
port
0.22
portrayed
0.22
Activations Density 0.217%