INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Somalia
    -0.09
    ంలోని
    -0.09
     zona
    -0.08
     Zona
    -0.08
    سط
    -0.07
     Somali
    -0.07
     pitanja
    -0.07
     superfície
    -0.07
    Zona
    -0.07
     shaped
    -0.07
    POSITIVE LOGITS
     afore
    0.10
    athe
    0.09
    liwe
    0.09
    logen
    0.08
    0.08
     organising
    0.08
    λό
    0.07
    burn
    0.07
     offender
    0.07
     TOKEN
    0.07
    Act Density 0.002%

    No Known Activations