INDEX
    Explanations

    Biological/scientific text

    New Auto-Interp
    Negative Logits
    çļĦçĶ»éĿ¢
    -0.28
     mound
    -0.28
     alike
    -0.27
    æ®Ľ
    -0.26
    Flip
    -0.26
    acias
    -0.26
    ictions
    -0.26
     fences
    -0.25
     euler
    -0.25
     ///</
    -0.25
    POSITIVE LOGITS
    段
    0.32
    å±Ģ
    0.26
    æľŁ
    0.25
     syn
    0.24
     sympathetic
    0.24
     vá»įng
    0.24
    éĺ¶æ®µ
    0.24
     validation
    0.24
    é¾Ł
    0.24
    GIN
    0.23
    Act Density 0.604%

    No Known Activations