INDEX
    Explanations

    near-zero activations indicating lack of significant content or structure in the text.

    New Auto-Interp
    Negative Logits
    OfClass
    -0.06
    Rates
    -0.06
    ?>
    ↵
    ↵
    -0.06
    -0.06
     nüfus
    -0.06
     can
    -0.06
    _UNICODE
    -0.06
    .%
    -0.06
    Thomas
    -0.06
     is
    -0.06
    POSITIVE LOGITS
    ول
    0.07
     biraz
    0.06
    RequestMethod
    0.06
     viewing
    0.06
     составляет
    0.06
    ного
    0.06
    0.06
     bulunan
    0.06
    0.06
    ังจาก
    0.06
    Act Density 0.203%

    No Known Activations