INDEX
    Explanations

    concepts related to community, care, and educational development

    New Auto-Interp
    Negative Logits
    -alist
    -0.18
    /fw
    -0.16
    ivet
    -0.16
    intColor
    -0.15
    riad
    -0.15
    że
    -0.14
    à¸Ńว
    -0.14
    åĢĻ
    -0.14
    dux
    -0.14
     BOT
    -0.14
    POSITIVE LOGITS
     Ã
    0.14
     Wander
    0.14
    lang
    0.14
    Č
    0.14
    aight
    0.14
    TM
    0.14
     bias
    0.14
    æ¹¾
    0.14
    seat
    0.13
    ur
    0.13
    Act Density 0.117%

    No Known Activations