INDEX
    Explanations

    temporal references or dates

    Numbers within dates and times

    dates and unique identifiers

    New Auto-Interp
    Negative Logits
    تقاوى
    -0.79
    andExpect
    -0.77
    enumii
    -0.75
     tartalomajánló
    -0.72
     Мексичка
    -0.70
    RTLI
    -0.70
    참고
    -0.69
     beginnetje
    -0.69
     mergeFrom
    -0.69
     ISNI
    -0.69
    POSITIVE LOGITS
    ↵↵
    0.63
    <eos>
    0.51
    UVWXYZ
    0.48
     betweenstory
    0.46
     Ucraina
    0.45
     года
    0.43
    literals
    0.41
    0.40
    ال
    0.40
     luft
    0.39
    Act Density 0.135%

    No Known Activations