INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    	margin
    -0.07
     Matlab
    -0.06
     invoices
    -0.06
     calculation
    -0.06
     potassium
    -0.06
    ountains
    -0.06
     EUR
    -0.06
    innen
    -0.06
    falls
    -0.06
     matlab
    -0.05
    POSITIVE LOGITS
    0.07
    ить
    0.07
    elly
    0.07
     SAT
    0.06
     الكتاب
    0.06
     Để
    0.06
    يتي
    0.06
     rogue
    0.06
    hack
    0.06
    шие
    0.06
    Act Density 0.002%

    No Known Activations