INDEX
    Explanations

    words related to historical events or societal structures

    New Auto-Interp
    Negative Logits
     disagre
    -1.84
     reluct
    -1.82
     shenan
    -1.76
     increa
    -1.75
     depic
    -1.75
     impra
    -1.72
     encomp
    -1.70
     affor
    -1.70
     philanth
    -1.70
     milf
    -1.70
    POSITIVE LOGITS
    <bos>
    0.98
    SequentialGroup
    0.66
     المعرف
    0.65
     depending
    0.63
     or
    0.63
     else
    0.63
     oder
    0.63
     etc
    0.62
    ":[{
    0.59
     หรือ
    0.58
    Act Density 0.842%

    No Known Activations