INDEX
    Explanations

    punctuation marks at the end of sentences

    New Auto-Interp
    Negative Logits
    expandindo
    -0.87
     ویکی‌پدیای
    -0.82
     defaultstate
    -0.79
     дописавши
    -0.79
    uxxxx
    -0.75
    ArrowToggle
    -0.75
     autorytatywna
    -0.74
     سكانية
    -0.71
     estimés
    -0.71
     مشين
    -0.71
    POSITIVE LOGITS
    5
    0.61
    0
    0.60
    0.58
    9
    0.56
     ongles
    0.55
    3
    0.54
    1
    0.54
    4
    0.53
    chließ
    0.52
    th
    0.52
    Act Density 0.261%

    No Known Activations