INDEX
Explanations
elements of criticism focusing on character development and emotional engagement in films
New Auto-Interp
Negative Logits
somewhat
-0.16
zn
-0.15
iros
-0.15
slightly
-0.15
naz
-0.14
emey
-0.14
ÅĻen
-0.14
ières
-0.14
.Normalize
-0.14
omo
-0.13
POSITIVE LOGITS
arden
0.16
instead
0.16
ness
0.16
bot
0.14
pad
0.14
Instead
0.14
bor
0.14
NOP
0.14
Attempts
0.14
Baghd
0.13
Activations Density 0.080%