INDEX
Explanations
words related to film and cinema terminology
New Auto-Interp
Negative Logits
405
-0.15
inger
-0.15
hal
-0.15
ange
-0.15
pert
-0.15
hiba
-0.15
ails
-0.14
empl
-0.14
halten
-0.14
oom
-0.14
POSITIVE LOGITS
loss
0.17
rift
0.16
oko
0.16
ikan
0.15
radan
0.15
weigh
0.15
rien
0.15
èĪĮ
0.15
Dirt
0.14
fer
0.14
Activations Density 0.020%