INDEX
    Explanations

    concepts related to community responsibility and personal accountability

    New Auto-Interp
    Negative Logits
    rese
    -0.16
    ikit
    -0.14
    arel
    -0.14
    icie
    -0.14
    haps
    -0.14
    chang
    -0.14
    lasses
    -0.14
     really
    -0.14
    arge
    -0.13
    άÏĤ
    -0.13
    POSITIVE LOGITS
     shall
    0.20
     Shall
    0.19
     colleg
    0.16
     personal
    0.16
     treat
    0.16
    fair
    0.16
     wherever
    0.15
    onest
    0.15
     honest
    0.15
     reasonably
    0.15
    Act Density 0.119%

    No Known Activations