INDEX
Explanations
phrases expressing quantity or degree, particularly the use of "so" with varying contexts
New Auto-Interp
Negative Logits
rica
-0.15
bestselling
-0.15
prit
-0.15
orig
-0.14
rof
-0.14
pard
-0.14
ousand
-0.14
-prepend
-0.13
odesk
-0.13
rico
-0.13
POSITIVE LOGITS
oth
0.18
ars
0.18
jom
0.17
isson
0.17
-called
0.16
ething
0.16
ARS
0.15
ovit
0.15
far
0.15
strain
0.15
Activations Density 0.068%