INDEX
    Explanations

    expressions of disappointment or dissatisfaction related to new developments or changes

    New Auto-Interp
    Negative Logits
    رÛĮÙĩ
    -0.16
     bump
    -0.16
     spit
    -0.15
    hang
    -0.15
    øj
    -0.15
    quil
    -0.15
    strup
    -0.14
    veau
    -0.14
    .sy
    -0.14
    add
    -0.14
    POSITIVE LOGITS
     aside
    0.27
    aside
    0.18
     Aside
    0.18
    uptools
    0.18
    =set
    0.18
    parameters
    0.18
    :set
    0.17
    embro
    0.17
    tle
    0.17
     sail
    0.17
    Act Density 0.065%

    No Known Activations