INDEX
    Explanations

    occurrences of the word "the."

    New Auto-Interp
    Negative Logits
    olle
    -0.20
    avel
    -0.14
    شد
    -0.14
    ud
    -0.14
    nun
    -0.14
    .dtd
    -0.14
    .intellij
    -0.13
    رÙĪØ¯
    -0.13
    .TabIndex
    -0.13
    otte
    -0.13
    POSITIVE LOGITS
     Bilg
    0.15
    ÙİØ§ÙĨ
    0.15
    icens
    0.14
    lah
    0.14
    .shiro
    0.13
     âĢķ
    0.13
     Brian
    0.13
    ~
    0.13
     acomp
    0.13
    afari
    0.13
    Act Density 0.023%

    No Known Activations