INDEX
    Explanations

    mentions of "Bart" and variations of the word "art."

    New Auto-Interp
    Negative Logits
    ÏĥÏī
    -0.15
    ock
    -0.15
    otherapy
    -0.14
    اعÙĬ
    -0.14
    kuk
    -0.14
    ignet
    -0.14
    eval
    -0.14
    osen
    -0.14
    ÑĥлÑı
    -0.14
    itz
    -0.14
    POSITIVE LOGITS
    AsStream
    0.16
    alars
    0.15
    ecast
    0.15
    urum
    0.15
    ereotype
    0.15
    opleft
    0.15
    abra
    0.14
     otherwise
    0.14
    edly
    0.14
    еÑĤа
    0.14
    Act Density 0.004%

    No Known Activations