INDEX
    Explanations

    elements related to observational commentary or opinion expressions

    New Auto-Interp
    Negative Logits
    efined
    -0.17
    ial
    -0.16
    htt
    -0.15
    uty
    -0.15
    ses
    -0.14
    upertino
    -0.14
    esan
    -0.14
    ниÑĨÑĭ
    -0.14
    ASS
    -0.14
    aspers
    -0.14
    POSITIVE LOGITS
    Binder
    0.15
    azu
    0.14
     synchronized
    0.14
    ÛĮز
    0.14
     Jame
    0.14
    alles
    0.13
    vla
    0.13
    kop
    0.13
    bla
    0.13
    place
    0.13
    Act Density 0.114%

    No Known Activations