INDEX
    Explanations

    references to the English language

    New Auto-Interp
    Negative Logits
    ThroughAttribute
    -1.02
     kasarigan
    -0.95
    __":
    
    -0.94
    __":
    -0.94
    parsedMessage
    -0.91
    #+#
    -0.88
     conftest
    -0.85
     propOrder
    -0.84
    Tembelea
    -0.84
    клопе
    -0.82
    POSITIVE LOGITS
     English
    2.45
    English
    2.12
     english
    1.87
     ENGLISH
    1.79
    english
    1.73
    ENGLISH
    1.52
     Spanish
    1.45
     French
    1.26
    Spanish
    1.21
     Englisch
    1.12
    Act Density 0.084%

    No Known Activations