INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    bau
    -0.16
     entirety
    -0.15
    ÑģÑĮого
    -0.14
     болÑĮÑĪин
    -0.14
    ArrayOf
    -0.14
    izz
    -0.14
    vie
    -0.13
     Ñģамого
    -0.13
     же
    -0.13
    æŁIJ
    -0.13
    POSITIVE LOGITS
     different
    0.31
    different
    0.27
     dozen
    0.25
    ä¸įåIJĮçļĦ
    0.23
     sclerosis
    0.19
    Different
    0.18
    ä¸įåIJĮ
    0.17
     differently
    0.17
     Different
    0.17
     other
    0.16
    Act Density 0.067%

    No Known Activations