INDEX
Explanations
references to the website IMDb
mentions of "IMDB" or related references to a film rating or review context
New Auto-Interp
Negative Logits
velt
-0.78
ieve
-0.76
halla
-0.73
nown
-0.73
ieves
-0.72
rooms
-0.70
yards
-0.66
ingen
-0.65
isson
-0.64
atan
-0.63
POSITIVE LOGITS
MED
1.17
PLIC
1.12
HO
1.09
PLE
1.08
PLIED
1.04
AX
1.02
MY
1.02
PROV
0.98
ITED
0.97
BO
0.96
Activations Density 0.018%