INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     spawn
    -0.09
     synthetic
    -0.08
     manufactured
    -0.08
     patented
    -0.08
     imagens
    -0.08
    spawn
    -0.07
    Spawn
    -0.07
    .spawn
    -0.07
     engineered
    -0.07
    ’image
    -0.07
    POSITIVE LOGITS
    博客
    0.12
     ब्लॉग
    0.12
    (blog
    0.12
     blogging
    0.11
     Blogging
    0.11
     blogs
    0.11
    ブログ
    0.11
     bloggers
    0.11
    /blog
    0.11
     блог
    0.10
    Act Density 0.075%

    No Known Activations