INDEX
Explanations
references to IM, likely referring to the website IMDb
references to the Internet Movie Database (IMDb)
New Auto-Interp
Negative Logits
nown
-0.73
quez
-0.69
tenance
-0.69
velt
-0.68
Borough
-0.67
ieve
-0.67
halla
-0.67
ieves
-0.66
rooms
-0.66
¶ħ
-0.66
POSITIVE LOGITS
MED
1.14
HO
1.13
PLIC
1.11
Db
1.05
MAC
0.97
PROV
0.97
PLE
0.94
PU
0.94
BO
0.93
AX
0.92
Activations Density 0.029%