INDEX
    Explanations

    terms related to human capabilities and desires

    New Auto-Interp
    Negative Logits
    iprot
    -0.78
    ikian
    -0.56
    </tfoot>
    -0.54
    tangentMode
    -0.54
    ///</
    -0.53
    sweise
    -0.53
    endregion
    -0.53
    ?}",
    -0.53
    positor
    -0.52
    MockBean
    -0.52
    POSITIVE LOGITS
     to
    0.87
    ในการ
    0.63
    tagHelperRunner
    0.60
     vilja
    0.56
    oa̍t
    0.55
     démocr
    0.55
     visant
    0.54
     riuscito
    0.53
     ability
    0.52
     popolare
    0.50
    Act Density 0.339%

    No Known Activations