INDEX
    Explanations

    multilingual text; answering prompts

    New Auto-Interp
    Negative Logits
    (and
    -0.09
     Numerous
    -0.09
    .should
    -0.08
     Schwe
    -0.08
    <object
    -0.08
    -enabled
    -0.08
    NEY
    -0.08
    -supported
    -0.08
    Ns
    -0.08
    SY
    -0.08
    POSITIVE LOGITS
    意义
    0.10
     notion
    0.09
    传统
    0.09
     معنى
    0.09
    meaning
    0.09
     traditional
    0.09
    意味
    0.09
     concept
    0.08
     notions
    0.08
     nghĩa
    0.08
    Act Density 0.013%

    No Known Activations