INDEX
    Explanations

    instances of the word "you"

    New Auto-Interp
    Negative Logits
    phy
    -0.15
     suit
    -0.15
    igan
    -0.14
    azzo
    -0.14
    adele
    -0.14
     jun
    -0.14
     Suit
    -0.14
    âĸį
    -0.14
     REFER
    -0.14
    urum
    -0.14
    POSITIVE LOGITS
    agoon
    0.17
    unger
    0.16
     ########.
    0.15
    bots
    0.14
     Depth
    0.14
    isons
    0.14
     depth
    0.14
    xito
    0.14
    icorn
    0.14
    ullan
    0.13
    Act Density 0.031%

    No Known Activations