INDEX
Explanations
references to specific films or projects
New Auto-Interp
Negative Logits
naments
-0.16
é©
-0.15
layers
-0.15
allet
-0.15
خصÙĪØµ
-0.14
elite
-0.14
ắt
-0.14
oria
-0.14
eler
-0.14
vasive
-0.14
POSITIVE LOGITS
til
0.23
mirror
0.23
mount
0.21
Til
0.20
Mount
0.20
body
0.20
APS
0.19
Mirror
0.19
bodies
0.18
mirror
0.18
Activations Density 0.002%