INDEX
    Explanations

    occurrences of the word "text" in various forms

    New Auto-Interp
    Negative Logits
    warts
    -0.07
    issing
    -0.07
    ibble
    -0.07
    aguay
    -0.06
    ueur
    -0.06
    hed
    -0.06
    abler
    -0.06
    lett
    -0.06
    anko
    -0.06
     dual
    -0.06
    POSITIVE LOGITS
    ual
    0.08
    icular
    0.08
    ually
    0.07
    ظ
    0.07
    ured
    0.07
    e
    0.07
    echn
    0.06
    ston
    0.06
    лÑİд
    0.06
    URED
    0.06
    Act Density 0.015%

    No Known Activations