INDEX
Explanations
phrases indicating an increase in quantity or availability
New Auto-Interp
Negative Logits
alars
-0.16
#
-0.16
amer
-0.15
ulumi
-0.15
piler
-0.15
senal
-0.15
nants
-0.15
tual
-0.15
-tm
-0.14
¶Į
-0.14
POSITIVE LOGITS
ष
0.18
Patri
0.16
oe
0.15
-than
0.15
than
0.15
Till
0.14
Uri
0.14
Browse
0.14
OT
0.14
Break
0.14
Activations Density 0.015%