INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     gratitude
    -0.06
    arks
    -0.06
     spat
    -0.06
    tatus
    -0.06
     wanting
    -0.06
    -0.06
     object
    -0.06
     بده
    -0.06
    ilenames
    -0.06
     Plant
    -0.06
    POSITIVE LOGITS
    	glVertex
    0.07
     floppy
    0.06
    	rs
    0.06
    Recipe
    0.06
    _featured
    0.06
     nec
    0.06
     Worldwide
    0.06
     goalie
    0.06
    ちょ
    0.06
     Toggle
    0.06
    Act Density 0.010%

    No Known Activations