INDEX
Explanations
specific details about film productions and prominent actors involved
New Auto-Interp
Negative Logits
anced
-0.17
Ì£
-0.15
/cgi
-0.15
kle
-0.14
inh
-0.14
Simon
-0.14
Chooser
-0.14
αι
-0.14
strcasecmp
-0.14
roit
-0.14
POSITIVE LOGITS
Heard
0.36
Amber
0.27
Pirates
0.23
Johnny
0.23
amber
0.23
Johnny
0.22
amber
0.21
heard
0.18
Fairfax
0.17
Hearing
0.17
Activations Density 0.004%