INDEX
    Explanations

    pronouns and their relationships to the subjects and actions in the text

    New Auto-Interp
    Negative Logits
    resco
    -0.16
    illard
    -0.15
    udget
    -0.14
    áj
    -0.14
     Mou
    -0.14
    lettes
    -0.13
    urve
    -0.13
    elow
    -0.13
    ailability
    -0.13
    elters
    -0.13
    POSITIVE LOGITS
    ilha
    0.14
    missive
    0.14
    483
    0.14
    icone
    0.14
    renom
    0.14
    nder
    0.14
    ileged
    0.13
    ãģıãģł
    0.13
    omit
    0.13
    imoto
    0.13
    Act Density 0.069%

    No Known Activations