INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     for
    1.02
    на
    0.80
    >
    0.80
    <0x80>
    0.77
    ١
    0.75
    u
    0.73
    0.73
    AZIONE
    0.72
    w
    0.71
    ча
    0.70
    POSITIVE LOGITS
     
    0.82
     is
    0.76
     उमर
    0.73
    0.71
    ïne
    0.66
     was
    0.66
    0.65
    osevelt
    0.65
    hton
    0.65
     Ulster
    0.65
    Act Density 0.002%

    No Known Activations