INDEX
    Explanations

    numerical citation formats within academic references

    New Auto-Interp
    Negative Logits
    ger
    -0.15
    arging
    -0.15
    iling
    -0.14
    oram
    -0.14
     terminal
    -0.14
    ply
    -0.14
     Terminal
    -0.14
    sher
    -0.14
    å©
    -0.14
    elyn
    -0.14
    POSITIVE LOGITS
    iggins
    0.17
     zoek
    0.16
    abwe
    0.14
    ncmp
    0.14
    Ñĩно
    0.13
    opa
    0.13
    ourke
    0.13
    ajan
    0.13
     ult
    0.13
     FAC
    0.13
    Act Density 0.021%

    No Known Activations