INDEX
Explanations
references to historical and cultural elements in cinema
New Auto-Interp
Negative Logits
was
-0.30
Was
-0.25
wasn
-0.25
isn
-0.24
was
-0.24
Was
-0.24
is
-0.23
_was
-0.22
ÑģÑĤанеÑĤ
-0.21
isnt
-0.19
POSITIVE LOGITS
are
0.70
were
0.68
weren
0.55
são
0.55
Were
0.54
sont
0.53
aren
0.52
were
0.52
waren
0.52
jsou
0.50
Activations Density 0.106%