INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     myſelf
    -1.60
     my
    -1.55
     myself
    -1.55
     meines
    -1.41
     mijn
    -1.40
     minhas
    -1.28
     meus
    -1.23
    my
    -1.18
     meinem
    -1.17
     mojej
    -1.17
    POSITIVE LOGITS
     sich
    0.55
    CloseOperation
    0.52
     .
    0.49
     other
    0.47
     us
    0.47
     ac
    0.44
     sub
    0.43
     “
    0.42
    例句
    0.42
     post
    0.41
    Act Density 0.212%

    No Known Activations