INDEX
    Explanations

    words and phrases that express high intensity or emphasis

    New Auto-Interp
    Negative Logits
    owie
    -0.16
    mana
    -0.14
    voir
    -0.14
    orde
    -0.14
    486
    -0.13
    ائÙĦ
    -0.13
    etwork
    -0.13
     Fist
    -0.13
    ALAR
    -0.13
    anian
    -0.13
    POSITIVE LOGITS
    zik
    0.16
    abus
    0.16
    Instr
    0.15
    à¸Ńà¸ļ
    0.14
    리ìĹIJ
    0.14
    ноп
    0.14
    .BLL
    0.14
    ÙĤاÙħ
    0.14
    aepernick
    0.14
    ritel
    0.14
    Act Density 0.013%

    No Known Activations