INDEX
    Explanations

    phrases indicating speech or communication

    New Auto-Interp
    Negative Logits
     dub
    -0.14
    ALAR
    -0.13
     çevir
    -0.13
    isdigit
    -0.12
    ugi
    -0.12
    translator
    -0.12
    edl
    -0.12
    .VisualBasic
    -0.12
    zeigen
    -0.12
    035
    -0.12
    POSITIVE LOGITS
     word
    1.09
     words
    1.09
    words
    0.91
    word
    0.91
     WORD
    0.89
    -word
    0.88
     Words
    0.85
     Word
    0.82
    Word
    0.80
    _word
    0.78
    Act Density 0.317%

    No Known Activations