INDEX
    Explanations

    phrases related to the acquisition of knowledge or information

    New Auto-Interp
    Negative Logits
    .proto
    -0.14
     íģ°
    -0.13
    çļĦä¸Ģ个
    -0.13
    /on
    -0.13
    regunta
    -0.13
    ÛĮÙĨÙĩ
    -0.13
    çŃĨ
    -0.13
    rz
    -0.13
    ulumi
    -0.13
    両
    -0.13
    POSITIVE LOGITS
     more
    0.52
    more
    0.35
     More
    0.31
    æĽ´å¤ļ
    0.30
     everything
    0.29
     about
    0.29
     más
    0.28
    _more
    0.27
     æĽ´
    0.27
     mehr
    0.27
    Act Density 0.042%

    No Known Activations