INDEX
Explanations
references to film analysis and reviews
New Auto-Interp
Negative Logits
EMA
-0.17
Banc
-0.15
anela
-0.15
åIJĪ
-0.14
extr
-0.14
ensing
-0.13
arger
-0.13
pping
-0.13
inho
-0.13
ses
-0.13
POSITIVE LOGITS
felt
0.26
felt
0.21
manages
0.18
succeeds
0.17
manage
0.17
suffer
0.16
feel
0.16
managing
0.16
Manage
0.16
feels
0.16
Activations Density 0.084%