INDEX
    Explanations

    emotional expressions and personal reflections on experiences

    New Auto-Interp
    Negative Logits
    VIRONMENT
    -0.73
     AppColors
    -0.73
     nakalista
    -0.70
    indd
    -0.70
    ^(@)
    -0.69
    bidden
    -0.67
    elcome
    -0.67
    ++
    
    -0.66
    %");
    -0.66
     @}
    -0.65
    POSITIVE LOGITS
     they
    1.14
     thats
    1.13
     I
    1.09
    they
    1.06
     it
    1.04
     we
    1.00
     you
    0.99
    I
    0.97
    thats
    0.94
    this
    0.92
    Act Density 0.442%

    No Known Activations