INDEX
    Explanations

    references to social justice and humane treatment issues

    New Auto-Interp
    Negative Logits
    @student
    -0.15
    CVE
    -0.14
    ÃĹ↵↵
    -0.14
    otify
    -0.14
     tý
    -0.14
    recated
    -0.14
    Writes
    -0.14
     Reads
    -0.14
    ICODE
    -0.14
    writes
    -0.14
    POSITIVE LOGITS
     to
    0.24
    :
    0.23
     '
    0.21
     ready
    0.20
     eyes
    0.20
     poised
    0.19
    0.19
     inches
    0.18
     bol
    0.18
     vs
    0.17
    Act Density 0.178%

    No Known Activations