INDEX
    Explanations

    symbols and formatting used in academic citations and references

    New Auto-Interp
    Negative Logits
    rana
    -0.16
    acf
    -0.15
    ascus
    -0.14
    rac
    -0.14
    ooth
    -0.14
     zp
    -0.14
    /display
    -0.14
     bowed
    -0.13
    uner
    -0.13
    fal
    -0.13
    POSITIVE LOGITS
    196
    0.25
    195
    0.23
    197
    0.20
    198
    0.17
    186
    0.17
    ingo
    0.17
    187
    0.16
    .kill
    0.16
    188
    0.16
    ickey
    0.16
    Act Density 0.043%

    No Known Activations