INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ковод
    -0.07
    	buff
    -0.07
     Franco
    -0.07
    gay
    -0.07
     nors
    -0.07
    BUFFER
    -0.06
    _movie
    -0.06
    นต
    -0.06
    road
    -0.06
    _tra
    -0.06
    POSITIVE LOGITS
     apex
    0.10
     Apex
    0.09
     zenith
    0.08
    ISE
    0.07
    MaxY
    0.07
    Edge
    0.07
     Pyramid
    0.07
    .centerY
    0.07
     bitterness
    0.07
    Z
    0.06
    Act Density 0.002%

    No Known Activations