INDEX
    Explanations

    personal possessive pronouns followed by specific nouns

    plural forms of the letter 's' in different contexts

    New Auto-Interp
    Negative Logits
     Hasan
    -0.72
     Presidents
    -0.72
     indemn
    -0.71
     boycot
    -0.67
     Grateful
    -0.62
     Suns
    -0.61
     Islamic
    -0.60
     Republican
    -0.59
     Ping
    -0.59
     Sinn
    -0.58
    POSITIVE LOGITS
    pecially
    1.14
    lightly
    1.13
    atisf
    1.11
    uddenly
    1.05
    ELF
    1.05
    ustainable
    1.02
    ources
    0.99
    omew
    0.99
    ustain
    0.98
    ̶
    0.94
    Act Density 0.303%

    No Known Activations