INDEX
    Explanations

    targeted content for younger audiences and children in various contexts

    New Auto-Interp
    Negative Logits
    ellig
    -0.15
    rlen
    -0.15
    batch
    -0.14
     Nose
    -0.14
     Discipline
    -0.14
    ichen
    -0.14
    ichern
    -0.14
     batch
    -0.14
     Cho
    -0.14
     Batch
    -0.14
    POSITIVE LOGITS
     ages
    0.20
    راÙĩ
    0.18
     age
    0.17
    ages
    0.17
    å¹´é¾Ħ
    0.17
    ener
    0.15
    ovo
    0.15
    éĺħ
    0.14
     Age
    0.14
     PHYS
    0.14
    Act Density 0.074%

    No Known Activations