INDEX
    Explanations

    expressions related to feelings of discomfort or unease

    New Auto-Interp
    Head Attr Weights
    0:0.04
    1:0.03
    2:0.22
    3:0.05
    4:0.26
    5:0.04
    6:0.02
    7:0.03
    8:0.09
    9:0.09
    10:0.05
    11:0.03
    Negative Logits
    ardless
    -1.43
     Lich
    -1.42
     Aff
    -1.27
     Tsukuyomi
    -1.24
    hetical
    -1.21
     Kore
    -1.21
    hai
    -1.20
     Ess
    -1.18
     Earthquake
    -1.18
     affinity
    -1.17
    POSITIVE LOGITS
    itiveness
    1.60
    ewater
    1.48
     Alto
    1.35
    beck
    1.33
    bour
    1.33
    ths
    1.32
    opers
    1.30
    Ã
    1.30
     frogs
    1.20
    nir
    1.20
    Act Density 0.000%

    No Known Activations