INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ्स
    2.16
    yr
    2.09
    ي
    2.00
    2.00
    ς
    1.73
    ri
    1.71
    していますが
    1.70
    faz
    1.68
    1.68
    illation
    1.66
    POSITIVE LOGITS
    mies
    2.00
    1.98
    isasi
    1.97
    1.93
    бина
    1.92
    শ্বর
    1.91
    m
    1.88
    ล์
    1.85
     तुम्ह
    1.84
    ل
    1.83
    Act Density 0.011%

    No Known Activations