INDEX
    Explanations

    comparative terms indicating superiority or performance, especially in a political or economic context

    New Auto-Interp
    Negative Logits
    fal
    -0.16
    yclopedia
    -0.16
    cete
    -0.15
    icional
    -0.14
    zia
    -0.14
    duk
    -0.14
    ette
    -0.14
    ampo
    -0.14
    ÙıÙĪØ§
    -0.14
    ulur
    -0.14
    POSITIVE LOGITS
     even
    0.28
    even
    0.22
     any
    0.22
     даже
    0.19
     than
    0.18
    çĶļèĩ³
    0.18
     EVEN
    0.17
     ever
    0.17
     anything
    0.17
     mere
    0.17
    Act Density 0.147%

    No Known Activations