INDEX
    Explanations

    phrases that refer to a small number or a pair of items

    New Auto-Interp
    Negative Logits
    azo
    -0.15
    ály
    -0.14
    hi
    -0.14
    ubi
    -0.14
     Fri
    -0.14
    noc
    -0.13
    ollapsed
    -0.13
    iol
    -0.13
     Host
    -0.13
     Starter
    -0.13
    POSITIVE LOGITS
     dozen
    0.20
     misc
    0.14
    DT
    0.14
    leton
    0.14
     of
    0.14
    Skin
    0.14
    ัà¸ģà¸ģ
    0.13
     sal
    0.13
    chip
    0.13
    caff
    0.13
    Act Density 0.020%

    No Known Activations