INDEX
    Explanations

    positive expressions or remarks

    New Auto-Interp
    Negative Logits
     periodically
    -0.65
    pour
    -0.55
    bil
    -0.55
    lia
    -0.53
     increment
    -0.52
     periodic
    -0.52
     corpus
    -0.51
     utility
    -0.51
     exit
    -0.50
     backdoor
    -0.50
    POSITIVE LOGITS
    efe
    0.72
    quite
    0.63
     dreamed
    0.60
     matched
    0.59
    ented
    0.59
    quickShipAvailable
    0.57
    ħĭ
    0.57
     afforded
    0.56
     hotter
    0.56
    bleacher
    0.55
    Act Density 0.209%

    No Known Activations