INDEX
Explanations
phrases indicating a critical or evaluative perspective on films
New Auto-Interp
Negative Logits
sát
-0.15
erset
-0.15
asco
-0.14
_rq
-0.14
Seriously
-0.14
omor
-0.14
LAP
-0.14
ovu
-0.14
æĬľ
-0.14
retim
-0.13
POSITIVE LOGITS
decent
0.25
OK
0.17
nicely
0.17
ok
0.16
okay
0.16
iler
0.16
nic
0.15
redeem
0.15
nice
0.15
basic
0.15
Activations Density 0.290%