INDEX
    Explanations

    the occurrence of the word "first" and its variations in different contexts

    New Auto-Interp
    Negative Logits
    aul
    -0.17
     for
    -0.15
    aret
    -0.15
    ider
    -0.14
    ook
    -0.14
    žit
    -0.14
    abil
    -0.14
     pul
    -0.14
    essen
    -0.13
    ara
    -0.13
    POSITIVE LOGITS
     times
    0.32
     fois
    0.25
     vez
    0.25
     keer
    0.23
     TIMES
    0.22
     time
    0.21
    times
    0.21
     veces
    0.20
     Times
    0.19
     vezes
    0.19
    Act Density 0.015%

    No Known Activations