INDEX
    Explanations

    words related to personal pronouns

    New Auto-Interp
    Negative Logits
     ſtate
    -0.58
    DataLoader
    -0.56
     uſe
    -0.54
    vább
    -0.53
     DataLoader
    -0.53
     uſed
    -0.51
    
    -0.50
     purpoſe
    -0.50
    }[!
    -0.49
     poveznice
    -0.49
    POSITIVE LOGITS
     his
    1.20
    His
    1.07
     His
    1.06
     own
    0.96
    his
    0.94
     kanyang
    0.93
     their
    0.91
    Their
    0.91
     HIS
    0.90
     seiner
    0.90
    Act Density 0.512%

    No Known Activations