INDEX
    Explanations

    sentences that conclude with a period

    New Auto-Interp
    Negative Logits
     autorytatywna
    -0.79
    χρι
    -0.72
    ManyToMany
    -0.68
    Попис
    -0.64
     Trouvez
    -0.63
     BorderRadius
    -0.62
     ویکی‌پدیا
    -0.61
    تقاوى
    -0.61
    Notae
    -0.60
    principalColumn
    -0.60
    POSITIVE LOGITS
     sito
    0.55
    validates
    0.54
    )
    0.53
     safety
    0.53
     Safety
    0.53
    LOU
    0.52
    })
    0.52
    ։
    0.52
     rito
    0.51
    dese
    0.51
    Act Density 0.377%

    No Known Activations