INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     renom
    -0.07
     aboard
    -0.07
     celebrated
    -0.07
    .TRAN
    -0.07
     stained
    -0.07
     financially
    -0.07
    STONE
    -0.07
     Stones
    -0.07
    STER
    -0.07
    /St
    -0.07
    POSITIVE LOGITS
     paragraphs
    0.10
     удел
    0.09
     sección
    0.09
     section
    0.09
    Entr
    0.09
    %左右
    0.08
     besteed
    0.08
     discussing
    0.08
     deur
    0.08
     էջ
    0.08
    Act Density 0.012%

    No Known Activations