INDEX
    Explanations

    instances of the word "we" and its variations

    New Auto-Interp
    Negative Logits
    cum
    -0.80
    Ore
    -0.65
    Posts
    -0.65
     Eleven
    -0.64
     Grav
    -0.63
     Coco
    -0.62
     Haku
    -0.60
     Clair
    -0.60
    photos
    -0.59
    Appearances
    -0.59
    POSITIVE LOGITS
    're
    1.19
    've
    1.19
    akening
    1.13
    eks
    1.05
     cannot
    1.00
    'll
    0.99
    athered
    0.99
     intend
    0.97
     believe
    0.96
     respectfully
    0.95
    Act Density 0.144%

    No Known Activations