INDEX
Explanations
references to the term "Plastics" or related concepts
New Auto-Interp
Negative Logits
yonel
-0.19
738
-0.18
ored
-0.16
obus
-0.16
an
-0.16
466
-0.15
olicit
-0.15
plain
-0.15
anja
-0.15
implify
-0.15
POSITIVE LOGITS
ummer
0.23
atts
0.22
ural
0.21
zens
0.20
inth
0.20
enary
0.20
asm
0.20
umb
0.19
zen
0.19
ottage
0.19
Activations Density 0.012%