INDEX
    Explanations

    phrases related to taking action or giving instructions

    references to collecting or managing items and the challenges associated with it

    New Auto-Interp
    Negative Logits
     outwe
    -0.69
     latter
    -0.60
     Caucas
    -0.60
     diam
    -0.59
     describ
    -0.57
     anecd
    -0.55
     Niet
    -0.54
    ighed
    -0.54
    renheit
    -0.53
     undermin
    -0.52
    POSITIVE LOGITS
     yourselves
    0.76
     yourself
    0.74
     Yourself
    0.74
     âĢº
    0.73
     !
    0.71
     ]
    0.68
     ye
    0.68
     your
    0.66
    ¶
    0.65
     Your
    0.65
    Act Density 0.690%

    No Known Activations