INDEX
    Explanations

    mentions of numbers or measurements

    occurrences of specific punctuation symbols or brackets

    New Auto-Interp
    Negative Logits
    rette
    -0.72
    ĵĺ
    -0.68
    aband
    -0.67
     neigh
    -0.66
    tones
    -0.66
     cradle
    -0.65
    marine
    -0.65
    etsy
    -0.65
    ãĤ©
    -0.64
    reet
    -0.64
    POSITIVE LOGITS
     However
    0.96
     Additionally
    0.91
     Furthermore
    0.85
     Similarly
    0.84
     Conversely
    0.82
     Likewise
    0.82
     Later
    0.81
     Nevertheless
    0.80
     Therefore
    0.77
     Alternatively
    0.76
    Act Density 0.050%

    No Known Activations