INDEX
Explanations
phrases related to film production and performances
New Auto-Interp
Negative Logits
assa
-0.17
ihar
-0.15
ili
-0.15
zl
-0.14
oleon
-0.14
мил
-0.14
bir
-0.13
alam
-0.13
cer
-0.13
chen
-0.13
POSITIVE LOGITS
originally
0.18
psc
0.16
finder
0.15
Originally
0.15
ymb
0.15
é«
0.15
mpp
0.15
mtx
0.15
Originally
0.14
0.14
Activations Density 0.128%