INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     flat
    -0.06
    “Oh
    -0.06
     purchaser
    -0.06
     
    -0.06
    "A
    -0.06
     nineteen
    -0.06
    "And
    -0.06
     mills
    -0.06
    	want
    -0.06
    (proj
    -0.06
    POSITIVE LOGITS
    ẫu
    0.07
    anoia
    0.07
    0.07
    hec
    0.06
     cola
    0.06
     مقاو
    0.06
     دستور
    0.06
    เคราะห
    0.06
    ість
    0.06
     Вот
    0.06
    Act Density 0.067%

    No Known Activations