INDEX
    Explanations

    instances of the word "first" and its variations

    New Auto-Interp
    Negative Logits
     first
    -0.18
    ixa
    -0.17
     further
    -0.17
     firstly
    -0.16
    dal
    -0.15
    rzy
    -0.15
    yssey
    -0.14
    ixo
    -0.14
    ssel
    -0.14
    essler
    -0.14
    POSITIVE LOGITS
    -ever
    0.39
    s
    0.36
    -hand
    0.33
    -rate
    0.32
    born
    0.31
     tiên
    0.30
    -time
    0.29
     responders
    0.27
    -order
    0.27
    -degree
    0.27
    Act Density 0.129%

    No Known Activations