INDEX
    Explanations

    words and phrases that convey a sense of danger or negativity

    New Auto-Interp
    Negative Logits
    GIVEREF
    -0.73
    日閲覧
    -0.58
    RefNanny
    -0.58
    __':
    
    -0.57
    setcounter
    -0.56
    enterOuterAlt
    -0.56
    addPreferredGap
    -0.55
     muhimu
    -0.54
    Biblia
    -0.54
    djangoproject
    -0.52
    POSITIVE LOGITS
     sounding
    0.79
    erweise
    0.76
    sounding
    0.71
    ly
    0.69
     situations
    0.69
    ness
    0.65
     looking
    0.65
    looking
    0.64
     behavior
    0.63
     behaviour
    0.62
    Act Density 0.458%

    No Known Activations