INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     mine
    -0.08
     penn
    -0.07
     leap
    -0.07
    Writer
    -0.07
    Mate
    -0.07
     feed
    -0.07
     cage
    -0.07
    -0.07
     mines
    -0.07
     feeding
    -0.07
    POSITIVE LOGITS
    0.09
     dominante
    0.09
     الهند
    0.08
     sabores
    0.08
    domin
    0.08
    deith
    0.08
     الاج
    0.08
    נית
    0.08
     कौन
    0.08
     predomin
    0.08
    Act Density 0.001%

    No Known Activations