INDEX
Explanations
references to films and filmmaking
New Auto-Interp
Negative Logits
trône
-0.76
étrang
-0.73
Vrij
-0.70
Toler
-0.69
Rache
-0.69
ANTAGE
-0.69
}".
-0.68
Vrij
-0.68
ırlı
-0.66
]").
-0.65
POSITIVE LOGITS
films
1.51
film
1.47
FILM
1.40
Film
1.36
film
1.26
Films
1.25
Film
1.18
FILMS
1.18
FILM
1.16
Films
1.16
Activations Density 0.037%