INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _FILES
    -0.08
    -haired
    -0.08
    _MATRIX
    -0.08
     FILE
    -0.07
     يعلم
    -0.07
     campaign
    -0.07
    _UNLOCK
    -0.07
    /')↵
    -0.07
    unterricht
    -0.07
    文件
    -0.07
    POSITIVE LOGITS
    Sentence
    0.11
     modifiers
    0.10
     sentence
    0.10
     ifad
    0.10
     Sentence
    0.10
    sentence
    0.10
    _sentence
    0.10
     expresa
    0.09
    一句
    0.09
    Modifiers
    0.09
    Act Density 0.020%

    No Known Activations