INDEX
Explanations
criticisms of character development and dialogue quality in films
New Auto-Interp
Negative Logits
doz
-0.17
ãĥ³ãĤ¸
-0.15
itecture
-0.15
arith
-0.15
rett
-0.14
imuth
-0.14
Bout
-0.14
Sugar
-0.14
vej
-0.13
å¯
-0.13
POSITIVE LOGITS
yd
0.15
bat
0.15
rel
0.14
çļ®
0.14
aug
0.14
ysi
0.14
acle
0.14
+=↵
0.14
Hoffman
0.14
.AF
0.14
Activations Density 0.174%