INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .microsoft
    -0.07
    -0.07
     karma
    -0.07
     integr
    -0.06
     irritation
    -0.06
     الك
    -0.06
     toile
    -0.06
     kure
    -0.06
     tractors
    -0.06
     الج
    -0.06
    POSITIVE LOGITS
    mana
    0.09
    Jen
    0.09
    manes
    0.08
    五星
    0.08
    June
    0.08
    ongen
    0.08
     Jen
    0.08
    _Framework
    0.08
     Higgins
    0.07
    Ware
    0.07
    Act Density 0.025%

    No Known Activations