INDEX
    Explanations

    ostensibly/ostentatious

    New Auto-Interp
    Negative Logits
    央企
    -0.08
     inducing
    -0.07
    酿酒
    -0.07
    ränk
    -0.07
    Hands
    -0.07
     esteemed
    -0.06
    agy
    -0.06
    -0.06
    rios
    -0.06
     quận
    -0.06
    POSITIVE LOGITS
     Kul
    0.07
    0.07
    phot
    0.07
     ########.
    0.07
    querySelector
    0.07
     optimistic
    0.06
    itial
    0.06
    0.06
    особ
    0.06
    итель
    0.06
    Act Density 0.001%

    No Known Activations