INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ра
    0.78
    ला
    0.75
    نا
    0.72
    ના
    0.66
    0.66
    ка
    0.64
    이는
    0.64
    ión
    0.63
    ها
    0.63
    0.62
    POSITIVE LOGITS
    0
    1.16
    3
    1.03
     antibodies
    0.99
    4
    0.97
    1
    0.95
    6
    0.92
    5
    0.86
     Antibodies
    0.83
    H
    0.80
    ت
    0.79
    Act Density 0.002%

    No Known Activations