INDEX
    Explanations

    comparative phrases and expressions of quantity

    New Auto-Interp
    Negative Logits
    \<^
    -0.16
    edBy
    -0.15
    ioxide
    -0.15
    wy
    -0.15
    geb
    -0.15
    .way
    -0.14
    onz
    -0.14
    scri
    -0.14
    ologically
    -0.13
    cairo
    -0.13
    POSITIVE LOGITS
     many
    0.17
    veral
    0.15
    ivity
    0.15
     fewer
    0.14
    zell
    0.14
    Ľå»º
    0.14
    еÑĤÑĮ
    0.14
    ous
    0.14
    insk
    0.14
     dozens
    0.14
    Act Density 0.189%

    No Known Activations