INDEX
Explanations
elements related to film criticism and artistic evaluation
New Auto-Interp
Negative Logits
azor
-0.17
igh
-0.17
vey
-0.16
ãģ°
-0.16
icut
-0.14
Heller
-0.14
ظ
-0.14
Ders
-0.14
vy
-0.14
olic
-0.14
POSITIVE LOGITS
ern
0.32
tern
0.28
ERN
0.28
fern
0.26
bern
0.24
TERN
0.24
enden
0.21
erne
0.20
ndern
0.20
ende
0.20
Activations Density 0.017%