INDEX
    Explanations

    personal pronouns and possessive pronouns

    New Auto-Interp
    Negative Logits
     sugg
    -0.78
    ikk
    -0.66
    inqu
    -0.61
    adish
    -0.61
    ksh
    -0.60
    VEN
    -0.59
     Inqu
    -0.59
     venture
    -0.59
    ule
    -0.58
    isure
    -0.56
    POSITIVE LOGITS
    png
    0.83
     signs
    0.75
     resemblance
    0.74
     versatility
    0.72
     plainly
    0.72
     glimps
    0.72
    ibilities
    0.72
     naked
    0.70
     clearly
    0.70
    formance
    0.70
    Act Density 0.138%

    No Known Activations