INDEX
    Explanations

    occurrences of the word "first"

    New Auto-Interp
    Negative Logits
     informa
    -0.33
     annan
    -0.31
     déf
    -0.30
     moelle
    -0.30
     AspNetCore
    -0.29
     obviamente
    -0.29
     evidentemente
    -0.28
     distrik
    -0.28
     carnes
    -0.28
     matéri
    -0.28
    POSITIVE LOGITS
     først
    0.62
    новь
    0.59
     originally
    0.59
     الرياضيه
    0.58
     initially
    0.58
    SequentialGroup
    0.57
    最初に
    0.57
     eerst
    0.57
     didst
    0.57
     Initially
    0.56
    Act Density 0.019%

    No Known Activations