INDEX
Explanations
words related to trailers or promotional material for films or shows
references to trailers or previews of films and games
New Auto-Interp
Negative Logits
Admin
-0.82
¬¼
-0.79
mathemat
-0.72
urrency
-0.67
subsistence
-0.65
ente
-0.65
nesota
-0.64
orem
-0.64
verning
-0.64
edIn
-0.64
POSITIVE LOGITS
trailer
1.39
trailers
1.22
teaser
1.21
tease
1.12
Trailer
1.05
showcasing
1.00
footage
0.99
screenshots
0.98
teased
0.97
IMAGES
0.95
Activations Density 0.133%