INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     proprietary
    -0.08
    .admin
    -0.07
     documents
    -0.07
     homework
    -0.07
    .orig
    -0.07
    RIC
    -0.07
    	Document
    -0.07
    Translations
    -0.07
    .permissions
    -0.07
     prescriptions
    -0.07
    POSITIVE LOGITS
     vertically
    0.11
     arranged
    0.11
     horizontally
    0.11
     directional
    0.10
     вертик
    0.10
     scattered
    0.10
     scatter
    0.10
    0.09
     perched
    0.09
     размест
    0.09
    Act Density 0.018%

    No Known Activations