INDEX
    Explanations

    the mention of the speaker or first-person perspective, especially referring to themselves

    New Auto-Interp
    Negative Logits
    paralle
    -0.76
    pole
    -0.73
    raper
    -0.69
    medium
    -0.66
    kefeller
    -0.66
    itialized
    -0.64
    */(
    -0.64
    edged
    -0.63
     pend
    -0.63
     fragmentation
    -0.62
    POSITIVE LOGITS
    asures
    1.18
    asured
    1.10
    cca
    1.09
    asuring
    1.08
    eting
    1.06
    asure
    1.06
    aning
    1.05
    zzo
    1.05
    adows
    0.95
    adow
    0.90
    Act Density 0.015%

    No Known Activations