INDEX
    Explanations

    references to youth and their involvement in community or social activities

    New Auto-Interp
    Negative Logits
    icut
    -0.16
    incinn
    -0.16
    ister
    -0.16
    ãĥ£
    -0.16
    abant
    -0.16
    asio
    -0.16
    aco
    -0.15
    ipsis
    -0.15
    uyen
    -0.14
    å§Ķ
    -0.14
    POSITIVE LOGITS
    quake
    0.21
    neys
    0.18
    fulness
    0.17
    venile
    0.16
    esterday
    0.16
    hood
    0.16
    entimes
    0.16
    codegen
    0.15
    /student
    0.15
    blood
    0.15
    Act Density 0.012%

    No Known Activations