INDEX
Explanations
references to movie titles and names in various contexts related to film and actors
New Auto-Interp
Negative Logits
idlo
-0.15
bis
-0.14
loh
-0.14
authors
-0.14
bis
-0.14
creativecommons
-0.13
Skywalker
-0.13
mÃŃt
-0.13
æ°ı
-0.13
nas
-0.13
POSITIVE LOGITS
director
0.23
directed
0.21
arring
0.20
II
0.20
dir
0.20
trailer
0.19
dir
0.19
movie
0.19
opposite
0.18
_LOGGER
0.18
Activations Density 0.071%