INDEX
Explanations
references to various genres and categories of Roman films
New Auto-Interp
Negative Logits
bic
-0.15
tings
-0.15
ories
-0.14
arp
-0.14
imposing
-0.14
zb
-0.14
udies
-0.14
icio
-0.14
:animated
-0.13
BCM
-0.13
POSITIVE LOGITS
aldo
0.16
è±
0.15
oldt
0.15
º
0.14
ctal
0.14
adena
0.14
Newly
0.14
оÑģÑĥд
0.14
обов
0.14
ffa
0.13
Activations Density 0.031%