INDEX
Explanations
references to specific brands or types of pens
New Auto-Interp
Negative Logits
oyo
-0.16
iano
-0.16
ево
-0.15
kah
-0.15
cz
-0.15
о
-0.14
-0.14
affer
-0.14
urr
-0.14
erule
-0.14
POSITIVE LOGITS
ultimate
0.30
pen
0.29
etration
0.27
Pen
0.26
pen
0.26
elope
0.25
ning
0.24
PEN
0.23
insula
0.22
Pen
0.22
Activations Density 0.019%