INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     a
    0.91
     I
    0.82
    I
    0.76
     It
    0.71
     (
    0.69
    A
    0.68
     Epidemiology
    0.66
     to
    0.65
    ↵↵
    0.62
     Río
    0.61
    POSITIVE LOGITS
     σε
    1.13
    1.09
    و
    1.02
    1.00
    0.93
     في
    0.90
     در
    0.87
    ق
    0.83
    0.80
    માં
    0.80
    Act Density 0.396%

    No Known Activations