INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ج
    2.30
    т
    1.91
    atl
    1.70
    1.55
    ق
    1.52
     Диа
    1.47
    yout
    1.44
    ف
    1.42
    1.42
    1.40
    POSITIVE LOGITS
    ALITY
    2.09
     dakika
    1.98
     progenitor
    1.91
    ALK
    1.87
    1.86
    Finger
    1.84
    1.84
    1.82
    就像
    1.78
    1.78
    Act Density 0.006%

    No Known Activations