INDEX
    Explanations

    mentions of small or young individuals, specifically when used as term of endearment or to indicate size

    the word "little" in various contexts

    New Auto-Interp
    Negative Logits
    chwitz
    -0.90
    anwhile
    -0.82
    igham
    -0.78
     conclud
    -0.77
    idential
    -0.75
    arbon
    -0.74
    cius
    -0.73
    orthy
    -0.73
    idents
    -0.72
    ï¸
    -0.71
    POSITIVE LOGITS
     bit
    0.98
     girl
    0.87
     kid
    0.85
     girls
    0.83
     boy
    0.82
     boys
    0.79
     helper
    0.76
     snippets
    0.75
     brother
    0.75
     sister
    0.75
    Act Density 0.026%

    No Known Activations