INDEX
    Explanations

    references to society and societal concepts

    New Auto-Interp
    Negative Logits
    -0.76
    mal
    -0.73
    ر
    -0.73
     ros
    -0.72
    cur
    -0.72
    un
    -0.69
    ar
    -0.68
    р
    -0.67
    f
    -0.67
     fla
    -0.64
    POSITIVE LOGITS
     Societies
    1.74
     societies
    1.68
     SOCIETY
    1.64
    Society
    1.63
     Society
    1.63
     society
    1.57
    society
    1.56
     Gesellschaft
    1.23
     sociedades
    1.18
     общество
    1.18
    Act Density 0.091%

    No Known Activations