INDEX
Explanations
references to film characters and their roles in narratives
New Auto-Interp
Negative Logits
uter
-0.16
ãĥ¼ãĥį
-0.15
ez
-0.15
cmds
-0.15
evaluation
-0.14
jmp
-0.14
jc
-0.14
hab
-0.14
ced
-0.14
ande
-0.14
POSITIVE LOGITS
Bat
0.35
Gotham
0.31
Bat
0.31
Batman
0.31
BAT
0.30
bat
0.29
Batman
0.28
bat
0.28
BAT
0.28
bats
0.26
Activations Density 0.049%