INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ичество
    -0.06
     Exam
    -0.06
     quitting
    -0.06
     logos
    -0.06
     Cohen
    -0.06
    [color
    -0.06
     Tests
    -0.06
    	items
    -0.06
    circ
    -0.06
     Cos
    -0.06
    POSITIVE LOGITS
    Keyboard
    0.07
    .CO
    0.07
     ('\
    0.07
    izabeth
    0.07
    -aged
    0.07
    scheduled
    0.07
    :''
    0.07
     NET
    0.06
     prov
    0.06
    ULER
    0.06
    Act Density 1.957%

    No Known Activations