INDEX
    Explanations

    instances of pronouns and their associated actions or states

    New Auto-Interp
    Negative Logits
    oppel
    -0.15
    opsis
    -0.15
     Semester
    -0.14
    uzzi
    -0.14
    во
    -0.14
    KF
    -0.14
    û
    -0.14
    bf
    -0.13
    θεν
    -0.13
    stvo
    -0.13
    POSITIVE LOGITS
    amen
    0.16
    atham
    0.14
    inding
    0.14
    à¹Ģà¸Ĺ
    0.14
     cite
    0.14
    anted
    0.14
     dik
    0.14
     Miss
    0.14
    REATE
    0.14
    Scoped
    0.13
    Act Density 0.037%

    No Known Activations