INDEX
    Explanations

    references to the English language

    New Auto-Interp
    Negative Logits
    ({_
    -0.48
    featureID
    -0.47
    deutig
    -0.46
    pulseira
    -0.46
     cytometry
    -0.45
    )"),
    -0.45
     kapturem
    -0.43
     vítima
    -0.43
     Coyle
    -0.42
    ceptor
    -0.41
    POSITIVE LOGITS
     English
    2.09
    English
    2.02
     english
    1.73
    english
    1.70
     ENGLISH
    1.68
    ENGLISH
    1.48
     Englisch
    1.22
    Engl
    1.20
     Engl
    1.20
     inglés
    1.18
    Act Density 0.010%

    No Known Activations