INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.08
    .connections
    -0.07
     Trojan
    -0.07
     Tal
    -0.07
     Winn
    -0.07
     talentos
    -0.07
     ek
    -0.07
     ciclos
    -0.07
     нар
    -0.07
     Veterans
    -0.07
    POSITIVE LOGITS
     shoreline
    0.09
     coastline
    0.09
    :number
    0.08
     kawasan
    0.08
     flown
    0.08
     fu
    0.08
    subtotal
    0.08
     شمال
    0.08
     Antalya
    0.08
     얼마
    0.08
    Act Density 0.007%

    No Known Activations