INDEX
    Explanations

    words associated with conflict, visibility, and negative emotions

    revealing secrets/feelings

    New Auto-Interp
    Negative Logits
    })->
    -0.62
    ConfigureAwait
    -0.62
    ,:);
    -0.61
    "}}
    -0.61
    })();
    
    -0.59
     Verd
    -0.59
     |
    
    -0.58
    ();}
    -0.57
     Kars
    -0.56
     ❖
    -0.56
    POSITIVE LOGITS
    ValueStyle
    0.56
    addCriterion
    0.52
    дото
    0.51
    ugin
    0.44
     possibilité
    0.43
    oreille
    0.43
     Initialized
    0.43
    ...
    0.43
    refundable
    0.42
    httphttps
    0.42
    Act Density 1.638%

    No Known Activations