INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     hari
    -0.09
     abaf
    -0.08
     feminine
    -0.08
     Wife
    -0.08
     tane
    -0.08
    Presentation
    -0.07
     bih
    -0.07
     presentation
    -0.07
     waz
    -0.07
     lob
    -0.07
    POSITIVE LOGITS
    -knit
    0.11
     cohesion
    0.10
    0.10
     solidarity
    0.10
     resilience
    0.10
     solidaridad
    0.09
     solidar
    0.09
     اجتماعی
    0.09
    જૂ
    0.09
     solidarité
    0.09
    Act Density 0.013%

    No Known Activations