INDEX
    Explanations

    instances of the word 'first' followed by a number

    occurrences of the word "first."

    New Auto-Interp
    Negative Logits
    orate
    -0.80
    eez
    -0.63
    hin
    -0.61
    endif
    -0.59
    soType
    -0.58
    illes
    -0.58
     impunity
    -0.57
    SPONSORED
    -0.56
    å§«
    -0.55
    rah
    -0.55
    POSITIVE LOGITS
     first
    3.13
    first
    2.51
     FIRST
    2.13
    First
    2.01
     First
    1.82
     second
    1.68
     earliest
    1.66
     initial
    1.55
     fourth
    1.39
     third
    1.36
    Act Density 0.095%

    No Known Activations