INDEX
    Explanations

    occurrences of the word "the."

    New Auto-Interp
    Negative Logits
    ulta
    -0.17
    753
    -0.15
    ortal
    -0.15
    ular
    -0.14
    Vars
    -0.14
    ernen
    -0.14
    OTT
    -0.14
    league
    -0.14
    iku
    -0.14
     vys
    -0.14
    POSITIVE LOGITS
    ÏĦÏĤ
    0.18
    ühr
    0.18
    ÙĨÚ¯ÛĮ
    0.16
    .ide
    0.15
    ìłĢ
    0.15
    ÏĬκ
    0.14
    ixmap
    0.14
    sein
    0.14
    _gpio
    0.14
    ugin
    0.14
    Act Density 0.517%

    No Known Activations