INDEX
    Explanations

    the presence of the word "to" in various contexts

    New Auto-Interp
    Negative Logits
    εÏį
    -0.16
    -marker
    -0.15
    /people
    -0.14
    ncia
    -0.14
     ntohs
    -0.14
    hire
    -0.14
    dbl
    -0.14
    ocrates
    -0.14
    amo
    -0.14
    ingly
    -0.14
    POSITIVE LOGITS
    ietf
    0.18
    pts
    0.17
    abbo
    0.17
    ajar
    0.15
    ÑĢаÑħ
    0.15
    azure
    0.15
     scre
    0.14
    raf
    0.14
    erras
    0.14
    perl
    0.14
    Act Density 0.088%

    No Known Activations