INDEX
    Explanations

    describing people

    New Auto-Interp
    Negative Logits
     cucumber
    -0.06
    PECIAL
    -0.06
    -Al
    -0.06
    partners
    -0.06
     jim
    -0.06
    _female
    -0.06
     IBM
    -0.06
     cute
    -0.06
    clide
    -0.06
    rary
    -0.06
    POSITIVE LOGITS
     توسط
    0.07
    scientific
    0.06
     غير
    0.06
     Slide
    0.06
     ประเภท
    0.06
    _this
    0.06
     broadcasting
    0.06
    collection
    0.06
    ası
    0.06
     Average
    0.06
    Act Density 0.142%

    No Known Activations