INDEX
    Explanations

    academic paper sections

    New Auto-Interp
    Negative Logits
     devs
    0.43
     recou
    0.43
     parab
    0.42
     coax
    0.40
     Halloween
    0.40
     jot
    0.40
    linkCell
    0.39
    クーポン
    0.39
     squir
    0.38
    0.38
    POSITIVE LOGITS
     thesis
    1.73
     dissertation
    1.63
    Thesis
    1.63
     Thesis
    1.59
    thesis
    1.56
     Dissertation
    1.51
     tesis
    1.21
    Diss
    1.09
     Doctoral
    1.09
     doctoral
    1.08
    Act Density 0.014%

    No Known Activations