INDEX
    Explanations

    imaginary unit and indices

    New Auto-Interp
    Negative Logits
    0.72
    ະລ
    0.68
     Menz
    0.65
    0.63
     cunt
    0.63
     чем
    0.63
    brus
    0.62
     وط
    0.62
    พลัง
    0.62
    غم
    0.62
    POSITIVE LOGITS
     i
    2.76
     İ
    2.20
    i
    2.15
    1.94
     í
    1.91
    1.87
    𝑖
    1.78
    1.75
    1.74
     아이
    1.74
    Act Density 1.543%

    No Known Activations