INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     save
    -0.06
    ATHER
    -0.06
    [date
    -0.06
     explosion
    -0.06
    oting
    -0.06
    uit
    -0.06
    ραση
    -0.06
    ávka
    -0.06
    arian
    -0.06
    iale
    -0.06
    POSITIVE LOGITS
     Articles
    0.07
    _SCL
    0.07
    0.07
     yüzyıl
    0.07
     circa
    0.06
     wifi
    0.06
     Constit
    0.06
    _SECTION
    0.06
     Lud
    0.06
     stretches
    0.06
    Act Density 0.002%

    No Known Activations