INDEX
    Explanations

    pronouns ending with 'ou' and related words

    New Auto-Interp
    Negative Logits
    ãĥ¼ãĥ³
    -0.90
    alf
    -0.87
    Ö¼
    -0.86
    UTION
    -0.86
    ãĥŃ
    -0.85
    ãĤ¢ãĥ«
    -0.85
    é¾įå¥ij士
    -0.79
    女
    -0.78
    enegger
    -0.76
    70710
    -0.75
    POSITIVE LOGITS
    reth
    1.15
    lette
    1.09
    vre
    1.03
    seless
    1.01
    mi
    0.96
    ng
    0.96
    vernment
    0.96
    hou
    0.94
    ji
    0.93
    rette
    0.92
    Act Density 8.617%

    No Known Activations