INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    lie
    -0.08
     ingress
    -0.08
     Nissan
    -0.08
    iën
    -0.07
     liang
    -0.07
    lander
    -0.07
     Dear
    -0.07
     Tavern
    -0.07
     ارت
    -0.07
    sab
    -0.07
    POSITIVE LOGITS
     collage
    0.14
    制作
    0.11
     stitching
    0.09
     scrapbook
    0.09
    0.09
    -making
    0.09
    -maker
    0.09
    orama
    0.09
    0.09
    写真
    0.08
    Act Density 0.003%

    No Known Activations