INDEX
    Explanations

    references to the Chinese surname "Zhu" (represented as "zh" in the data)

    New Auto-Interp
    Negative Logits
     <<<<<<<<<<<<<<
    -0.72
    š
    -0.68
    Š
    -0.66
    ImageContext
    -0.65
     propOrder
    -0.65
     électriques
    -0.64
    InjectAttribute
    -0.63
    contentLoaded
    -0.60
    ArrowToggle
    -0.60
     sanitaires
    -0.60
    POSITIVE LOGITS
    zh
    3.17
    ZH
    1.87
    Zh
    1.70
     zh
    1.65
     Zh
    1.64
     ZH
    1.04
    zhi
    0.93
    zha
    0.86
    zhu
    0.77
    zhe
    0.76
    Act Density 0.002%

    No Known Activations