INDEX
    Explanations

    positive sentiments and expressions of gratitude

    New Auto-Interp
    Negative Logits
     WWF
    -0.66
    ancel
    -0.61
     Greenpeace
    -0.59
    ammy
    -0.57
     mediation
    -0.56
     Griffin
    -0.56
     Emin
    -0.53
    winner
    -0.53
    ouses
    -0.53
    erman
    -0.52
    POSITIVE LOGITS
     Magicka
    0.69
    ":[
    0.64
    Bi
    0.63
    ."[
    0.61
    bler
    0.60
     dracon
    0.59
    enth
    0.59
     physically
    0.59
    igr
    0.58
    cells
    0.58
    Act Density 1.220%

    No Known Activations