INDEX
Explanations
references to specific films or cinema styles
New Auto-Interp
Negative Logits
ness
-0.32
so
-0.32
ìĿĦ
-0.27
ship
-0.27
ne
-0.27
nya
-0.27
ri
-0.27
land
-0.26
self
-0.24
set
-0.23
POSITIVE LOGITS
urope
0.21
ighborhood
0.16
lected
0.16
ourcem
0.16
iros
0.16
ighbors
0.15
ems
0.15
vents
0.15
pond
0.15
iro
0.15
Activations Density 0.556%