INDEX
    Explanations

    words related to incentives or disincentives

    words and concepts related to incentives or motivations

    New Auto-Interp
    Negative Logits
     incap
    -0.67
     answ
    -0.66
    ŃĶ
    -0.65
    fac
    -0.64
    ¾
    -0.64
     Citiz
    -0.64
    Fac
    -0.60
    Ī
    -0.59
    ipeg
    -0.59
    Scient
    -0.57
    POSITIVE LOGITS
    hips
    0.98
    oft
    0.96
    heet
    0.94
    hare
    0.89
    ilver
    0.89
    cript
    0.89
    liga
    0.85
    paces
    0.84
    terday
    0.84
    ync
    0.82
    Act Density 0.141%

    No Known Activations