INDEX
    Explanations

    phrases and variations of the word "get."

    New Auto-Interp
    Negative Logits
     Commons
    -0.18
    ADX
    -0.15
    oola
    -0.15
    oded
    -0.14
    iná
    -0.14
    ollapsed
    -0.14
    imeo
    -0.14
    اÙĨÚ¯
    -0.14
    execution
    -0.13
    notated
    -0.13
    POSITIVE LOGITS
    -to
    0.25
     lost
    0.20
     introduced
    0.20
    tok
    0.18
     tog
    0.17
     reint
    0.17
     know
    0.17
    ultan
    0.16
     re
    0.16
     acquainted
    0.16
    Act Density 0.040%

    No Known Activations