INDEX
Explanations
movie titles
titles and key phrases from popular entertainment franchises or series
New Auto-Interp
Negative Logits
imentary
-0.78
uckland
-0.77
prus
-0.77
ickr
-0.75
fed
-0.74
uga
-0.73
dispersed
-0.72
knowledgeable
-0.71
arij
-0.70
retire
-0.69
POSITIVE LOGITS
sequels
1.11
Trilogy
1.02
trilogy
1.01
Animated
0.99
films
0.98
soundtrack
0.97
Seasons
0.95
sequel
0.95
Movie
0.93
Movie
0.92
Activations Density 0.179%