INDEX
    Explanations

    French articles and pronouns

    New Auto-Interp
    Negative Logits
     Theſe
    -0.98
     themſelves
    -0.93
     himſelf
    -0.88
    BibitemShut
    -0.88
     ―――――
    -0.87
     ſind
    -0.85
     iconFacebook
    -0.84
    ſelves
    -0.82
     ſeveral
    -0.82
    DeleteBehavior
    -0.82
    POSITIVE LOGITS
     la
    1.10
     le
    0.77
     M
    0.75
     La
    0.72
     L
    0.69
    BorderFactory
    0.69
    La
    0.68
    員の
    0.66
    la
    0.65
     J
    0.65
    Act Density 0.073%

    No Known Activations