INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     бизнес
    -0.07
    -0.06
     işe
    -0.06
    ملكة
    -0.06
    ience
    -0.06
    يلاد
    -0.06
    Houston
    -0.06
     erotisk
    -0.06
     rice
    -0.06
     weiber
    -0.06
    POSITIVE LOGITS
     Mat
    0.14
    Mat
    0.13
     mat
    0.13
     Mats
    0.12
     MAT
    0.11
    mat
    0.10
    Matthew
    0.10
     Matt
    0.10
     mats
    0.10
     Matthew
    0.10
    Act Density 0.011%

    No Known Activations