INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    üler
    0.76
    ot
    0.66
    os
    0.66
    eer
    0.66
    ol
    0.64
     glor
    0.63
    ";
    0.63
     (
    0.62
    ală
    0.61
    是用
    0.60
    POSITIVE LOGITS
    на
    0.88
    ف
    0.81
    ب
    0.80
    ат
    0.75
    اب
    0.73
    ید
    0.73
    за
    0.72
    Coronavirus
    0.72
    م
    0.69
    pandemic
    0.68
    Act Density 0.004%

    No Known Activations