INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     мәкал
    -0.75
    ✨:
    -0.68
    kháu
    -0.68
    下载附件
    -0.68
     Akismet
    -0.68
     <<<<<<<<<<<<<<
    -0.68
    exus
    -0.67
     مشين
    -0.67
     Audiodateien
    -0.66
    +:+
    -0.66
    POSITIVE LOGITS
    helpers
    0.45
     exactly
    0.44
    s
    0.44
     came
    0.43
    m
    0.43
    uern
    0.42
    jsii
    0.41
     برانيه
    0.41
    l
    0.40
     Int
    0.40
    Act Density 0.296%

    No Known Activations