INDEX
    Explanations

    intensifiers that convey strong emotions or opinions

    New Auto-Interp
    Negative Logits
    antro
    -0.17
    ãģĦãĤĭ
    -0.15
    izzo
    -0.14
    ÙĪØ²
    -0.14
    èŃ·
    -0.14
    ;base
    -0.14
    Ñħод
    -0.14
    à¥Īसल
    -0.14
    hea
    -0.14
    изнеÑģ
    -0.14
    POSITIVE LOGITS
    ething
    0.16
    ĶåĽŀ
    0.15
    quier
    0.14
     ανά
    0.13
    ارة
    0.13
    IDEO
    0.13
    ienes
    0.13
    reau
    0.13
    á»įng
    0.13
    YRO
    0.13
    Act Density 0.015%

    No Known Activations