INDEX
Explanations
references to movie roles and performances
New Auto-Interp
Negative Logits
anim
-0.15
ستگÛĮ
-0.15
å¸Ń
-0.15
ê¶Į
-0.15
ariat
-0.15
èŃľ
-0.15
æĨ¶
-0.15
Bonus
-0.15
azon
-0.15
bris
-0.14
POSITIVE LOGITS
opposite
0.68
alongside
0.36
playing
0.35
Opp
0.35
playing
0.33
oppos
0.31
beside
0.31
Opp
0.30
пÑĢоÑĤивоп
0.29
Playing
0.28
Activations Density 0.034%