INDEX
    Explanations

    terms related to children and their experiences

    New Auto-Interp
    Negative Logits
    جع
    -0.17
    amente
    -0.16
    gether
    -0.15
    ilt
    -0.15
    UDA
    -0.15
    aravel
    -0.14
    uet
    -0.14
    atively
    -0.14
    ulumi
    -0.14
    ative
    -0.14
    POSITIVE LOGITS
    s
    0.35
     who
    0.32
    /y
    0.29
    /gr
    0.28
     whom
    0.27
    /ad
    0.27
    ImageSharp
    0.26
    nap
    0.26
    eren
    0.25
     aged
    0.25
    Act Density 0.080%

    No Known Activations