INDEX
    Explanations

    personal pronouns or possessive pronouns

    references to personal experiences or opinions expressed as questions and statements

    New Auto-Interp
    Negative Logits
     ILCS
    -0.87
    ateral
    -0.69
    ï¸ı
    -0.68
     Pole
    -0.67
    planes
    -0.65
    achus
    -0.65
    ioxide
    -0.62
    apixel
    -0.60
    axy
    -0.60
     Waters
    -0.60
    POSITIVE LOGITS
     chose
    0.74
     differed
    0.73
     differs
    0.68
     differ
    0.67
     invoked
    0.64
     sacrific
    0.64
     practition
    0.64
     staggered
    0.63
     suspic
    0.62
    called
    0.61
    Act Density 0.118%

    No Known Activations