INDEX
Explanations
film-related terms and variations of the word "film."
New Auto-Interp
Negative Logits
pal
-0.16
pants
-0.16
762
-0.15
illin
-0.15
offs
-0.15
izer
-0.15
uala
-0.14
Clifford
-0.14
ZN
-0.14
FTA
-0.14
POSITIVE LOGITS
ippo
0.30
ipp
0.29
Fil
0.23
aments
0.22
оÑģоÑĦ
0.22
thy
0.22
leted
0.21
fil
0.21
ament
0.20
fila
0.20
Activations Density 0.009%