INDEX
    Explanations

    scientific/technical texts

    New Auto-Interp
    Negative Logits
    åįı
    -0.28
    äºĨèĩªå·±çļĦ
    -0.26
    éĩį
    -0.26
    iangle
    -0.26
    ç°Į
    -0.25
    =image
    -0.25
    нÑĮ
    -0.24
     pé
    -0.24
    _CP
    -0.23
    éĩįéĩijå±ŀ
    -0.23
    POSITIVE LOGITS
    edList
    0.31
    nums
    0.28
    è¾¾
    0.27
    asan
    0.27
     unders
    0.26
    为主çļĦ
    0.25
    æīĵéĢļ
    0.25
    plevel
    0.25
    åĬĽè¿ĺæĺ¯
    0.25
    å¤ĦçIJĨ
    0.24
    Act Density 0.017%

    No Known Activations