INDEX
    Explanations

    expressions of boasting or bragging about achievements or qualities

    New Auto-Interp
    Negative Logits
    arkin
    -0.18
    wie
    -0.16
    eti
    -0.15
    pData
    -0.15
    apsed
    -0.15
     Colomb
    -0.15
    aise
    -0.14
    unicode
    -0.14
     Laws
    -0.14
     Pearson
    -0.14
    POSITIVE LOGITS
    engu
    0.16
    Ñĩи
    0.15
    ably
    0.14
    Builders
    0.14
    .ci
    0.14
    à¥Ĥद
    0.14
    غÙĨ
    0.14
    afari
    0.14
    nest
    0.13
    itia
    0.13
    Act Density 0.009%

    No Known Activations