INDEX
    Explanations

    phrases indicating quantities or counts

    New Auto-Interp
    Negative Logits
    CRET
    -0.14
    /apt
    -0.13
     cud
    -0.13
     пеÑĢеп
    -0.13
     base
    -0.13
    icot
    -0.13
    æķ´ä¸ª
    -0.13
     ust
    -0.13
    turnstile
    -0.13
    .Accessible
    -0.13
    POSITIVE LOGITS
     different
    0.17
     dozen
    0.15
     EVT
    0.15
    different
    0.15
     dalÅ¡ÃŃch
    0.14
    eração
    0.14
     sclerosis
    0.14
    adder
    0.14
    allis
    0.14
    tdown
    0.14
    Act Density 0.027%

    No Known Activations