INDEX
    Explanations

    terms related to changes in status or position

    New Auto-Interp
    Negative Logits
     تانيه
    -0.82
     UnityEditor
    -0.61
    )_/¯
    -0.60
    SPATH
    -0.58
    tableFuture
    -0.58
    -0.58
    gatsby
    -0.58
     للمعارف
    -0.57
    wikidata
    -0.57
    يتر
    -0.56
    POSITIVE LOGITS
     pose
    0.74
     shift
    0.68
    فحة
    0.63
     put
    0.61
     move
    0.61
     shifts
    0.59
     Put
    0.59
     smile
    0.59
     Shift
    0.59
     Putting
    0.58
    Act Density 0.174%

    No Known Activations