INDEX
    Explanations

    references to examples and case studies related to social issues

    New Auto-Interp
    Negative Logits
    _iff
    -0.14
    ãģ¯ãģļ
    -0.14
    isser
    -0.13
    stoup
    -0.12
    ought
    -0.12
    ÄĻż
    -0.12
    ÃŃÅ¡
    -0.11
    strument
    -0.11
    ipher
    -0.11
    ibs
    -0.11
    POSITIVE LOGITS
     example
    0.99
     examples
    0.93
    example
    0.81
    examples
    0.77
     Example
    0.76
     exemple
    0.75
     Examples
    0.74
    ä¾ĭ
    0.74
     пÑĢимеÑĢ
    0.73
    -example
    0.72
    Act Density 0.477%

    No Known Activations