INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ع
    1.75
    ل
    1.73
    to
    1.70
    1.36
    ج
    1.32
    H
    1.30
    ك
    1.30
    이지만
    1.27
    ب
    1.25
    خ
    1.23
    POSITIVE LOGITS
     Bear
    1.33
    ,
    1.13
     bear
    1.05
     I
    1.04
    ite
    0.96
     
    0.93
    0.93
     Section
    0.93
     Road
    0.92
     Soviet
    0.86
    Act Density 0.003%

    No Known Activations