INDEX
    Explanations

    pronouns referring to groups of people

    New Auto-Interp
    Negative Logits
    onal
    -0.78
    heny
    -0.71
    rought
    -0.65
    emis
    -0.64
     Federation
    -0.63
    ion
    -0.62
     Mub
    -0.61
    olid
    -0.61
    wire
    -0.61
    microsoft
    -0.58
    POSITIVE LOGITS
    é¾įåĸļ士
    0.99
    selves
    0.78
    ternally
    0.75
    ortium
    0.73
    DragonMagazine
    0.70
    çīĪ
    0.69
    gypt
    0.68
    ãģ¯
    0.68
    imei
    0.67
    æ³
    0.67
    Act Density 0.368%

    No Known Activations