INDEX
    Explanations

    instances of negation and expressions of skepticism

    New Auto-Interp
    Negative Logits
    tagHelper
    -0.38
     تانيه
    -0.38
     pasos
    -0.35
     wiedzy
    -0.35
     timeval
    -0.35
     Mischung
    -0.35
     Gemeinschaft
    -0.34
     Glauben
    -0.33
     kochen
    -0.33
    pyx
    -0.32
    POSITIVE LOGITS
    #+#
    0.62
     autorytatywna
    0.59
    imachinery
    0.58
     newOwner
    0.57
    expandindo
    0.57
    脚注の使い方
    0.56
     disambiguazione
    0.56
    хьтан
    0.55
     Italijani
    0.55
    ✨:
    0.55
    Act Density 0.537%

    No Known Activations