INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    coat
    -0.07
     Innoc
    -0.07
     mechanical
    -0.07
    -0.07
     جون
    -0.07
     kont
    -0.06
     nickel
    -0.06
    zent
    -0.06
     dni
    -0.06
    avic
    -0.06
    POSITIVE LOGITS
     spread
    0.14
     Spread
    0.12
     spreading
    0.11
     spreads
    0.10
    Spread
    0.09
    spread
    0.08
    0.07
     marketing
    0.07
    patterns
    0.07
    	ID
    0.07
    Act Density 0.013%

    No Known Activations