INDEX
    Explanations

    ethnicity, native, robust child

    New Auto-Interp
    Negative Logits
     própria
    0.45
     informacion
    0.44
     الدولة
    0.44
     atmósfera
    0.39
     their
    0.38
     noticia
    0.38
     plataformas
    0.38
     testimonies
    0.38
     próprios
    0.38
     garantiza
    0.38
    POSITIVE LOGITS
    י
    0.42
    т
    0.41
    0.40
    गाह
    0.39
     kyll
    0.39
    राग
    0.39
    Wil
    0.39
     Instant
    0.38
    तीत
    0.38
    ırd
    0.38
    Act Density 0.003%

    No Known Activations