INDEX
    Explanations

    references to the word "first" in various contexts

    New Auto-Interp
    Negative Logits
    reszcie
    -0.71
    %)$
    -0.71
    rrggbb
    -0.68
     Winaray
    -0.64
    PyExc
    -0.60
     finally
    -0.60
     lastly
    -0.59
    rawDesc
    -0.59
     exaggeration
    -0.57
    hésite
    -0.56
    POSITIVE LOGITS
     few
    0.83
    born
    0.81
     responders
    0.80
     thing
    0.79
     aider
    0.78
     aid
    0.77
     impression
    0.74
     ever
    0.73
     Aid
    0.73
     glance
    0.73
    Act Density 0.150%

    No Known Activations