INDEX
    Explanations

    signs manifestations

    New Auto-Interp
    Negative Logits
     Species
    -0.07
    getDate
    -0.07
    pecies
    -0.06
     شناسی
    -0.06
     Giles
    -0.06
    	mat
    -0.06
    работ
    -0.06
    quette
    -0.06
     Feed
    -0.06
    resden
    -0.06
    POSITIVE LOGITS
    authors
    0.07
     Dwight
    0.07
    >{@
    0.06
    0.06
     der
    0.06
    ***↵↵
    0.06
    .bootstrap
    0.06
     виб
    0.06
     Прот
    0.06
    _An
    0.06
    Act Density 0.075%

    No Known Activations