INDEX
    Explanations

    references to individuality and individualism

    New Auto-Interp
    Negative Logits
    dden
    -0.17
    ylon
    -0.16
    ArgumentException
    -0.15
    گاÙĩ
    -0.15
    bulan
    -0.15
    ÙĤÛĮ
    -0.15
    oning
    -0.14
    ibilities
    -0.14
    iero
    -0.14
    нд
    -0.14
    POSITIVE LOGITS
    ized
    0.23
    /groups
    0.20
    /group
    0.20
    ity
    0.19
    itarian
    0.19
    ize
    0.19
    /single
    0.18
    ités
    0.18
    mente
    0.18
    /team
    0.18
    Act Density 0.029%

    No Known Activations