INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    inden
    -0.08
    -0.08
    habil
    -0.07
     mostly
    -0.07
    placer
    -0.07
    513
    -0.07
    chef
    -0.07
    وط
    -0.07
     tutorial
    -0.07
     مہ
    -0.07
    POSITIVE LOGITS
    0.08
    .Agent
    0.08
     Bloom
    0.08
     microsc
    0.08
     Panther
    0.08
    -events
    0.08
    0.08
     Franz
    0.07
     Gartner
    0.07
     Offices
    0.07
    Act Density 0.002%

    No Known Activations