INDEX
    Explanations

    elements related to documentation and reporting

    New Auto-Interp
    Negative Logits
    аÑĢаÑĤ
    -0.17
    ÏĦικα
    -0.15
    bers
    -0.14
    .WinForms
    -0.14
     substit
    -0.14
    ser
    -0.14
     gent
    -0.14
    agna
    -0.14
     Rockefeller
    -0.14
    Shoot
    -0.14
    POSITIVE LOGITS
    roje
    0.16
    anzi
    0.16
    plat
    0.15
    	icon
    0.14
     elevation
    0.14
     icon
    0.14
    icon
    0.13
    ени
    0.13
    chten
    0.13
     naken
    0.13
    Act Density 0.031%

    No Known Activations