INDEX
    Explanations

    possessive pronouns and their associated references

    New Auto-Interp
    Negative Logits
    gre
    -0.19
    urs
    -0.17
    ady
    -0.15
    urette
    -0.15
    uali
    -0.14
    alker
    -0.14
    _Instance
    -0.14
     borderTop
    -0.14
    olang
    -0.14
    ohana
    -0.14
    POSITIVE LOGITS
    ë§IJ
    0.15
    eyin
    0.15
    ROME
    0.14
     Marsh
    0.14
     McCl
    0.14
    ops
    0.13
    stras
    0.13
    aphore
    0.13
    .Cmd
    0.13
    &E
    0.13
    Act Density 0.076%

    No Known Activations