INDEX
    Explanations

    phrases indicating a reduction or decrease in quantity or quality

    New Auto-Interp
    Negative Logits
    umu
    -0.15
    uild
    -0.14
     kostenlose
    -0.14
    ÙĨدر
    -0.14
    iaÅĤ
    -0.14
    REAK
    -0.13
    å¡
    -0.13
    ÄIJT
    -0.13
    outh
    -0.13
    .setView
    -0.13
    POSITIVE LOGITS
     than
    0.23
    -than
    0.22
    ening
    0.19
    _than
    0.18
    Than
    0.18
     než
    0.17
    eren
    0.17
    /no
    0.16
     THAN
    0.16
    ere
    0.16
    Act Density 0.028%

    No Known Activations