INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     regulators
    -0.07
     regulament
    -0.07
     thinner
    -0.07
    є
    -0.07
     nyl
    -0.07
     freshly
    -0.07
    illery
    -0.06
     regulator
    -0.06
    ér
    -0.06
    قاعد
    -0.06
    POSITIVE LOGITS
     midpoint
    0.09
    982
    0.08
    ატონ
    0.08
    առնալ
    0.08
    avg
    0.08
    AVG
    0.08
    իպ
    0.08
     között
    0.08
    Coords
    0.08
    	Add
    0.08
    Act Density 0.033%

    No Known Activations