INDEX
    Explanations

    adjectives and verbs related to physical appearance and actions

    phrases indicating conditions, assessments, or choices

    New Auto-Interp
    Negative Logits
     ______
    -0.62
    .''.
    -0.57
     âĢİ
    -0.54
    ¶
    -0.53
     antioxid
    -0.53
     gmaxwell
    -0.53
     thence
    -0.53
    -0.52
     pursuant
    -0.52
     Jihad
    -0.52
    POSITIVE LOGITS
    Đ
    1.19
    ù
    1.19
     RandomRedditor
    1.19
    Ă
    1.19
    ă
    1.19
    ø
    1.19
    đ
    1.19
    ė
    1.19
    Ě
    1.19
    û
    1.19
    Act Density 1.515%

    No Known Activations