INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     shutil
    -0.08
    iaid
    -0.07
    cing
    -0.07
    koj
    -0.07
     Ausstellung
    -0.07
     Fug
    -0.07
    代理
    -0.07
     manuals
    -0.07
     swiss
    -0.07
    Shapes
    -0.07
    POSITIVE LOGITS
    ROLE
    0.08
     начале
    0.08
     ROLE
    0.08
     -*-
    0.08
    0.08
    PHONE
    0.08
     Celt
    0.08
    &nbsp
    0.08
     groundwork
    0.08
     jednym
    0.07
    Act Density 0.002%

    No Known Activations