INDEX
    Explanations

    numerical references and citations in academic text

    New Auto-Interp
    Negative Logits
    itele
    -0.17
    สะ
    -0.16
    ieber
    -0.16
    ieves
    -0.15
    aten
    -0.15
    kyt
    -0.15
    ANJI
    -0.14
    echn
    -0.14
    ichert
    -0.14
    bris
    -0.14
    POSITIVE LOGITS
     Synthetic
    0.16
    IPC
    0.16
     Cooler
    0.15
     synthetic
    0.15
     flu
    0.14
    anus
    0.14
     Cool
    0.14
    okit
    0.13
     cool
    0.13
     Ple
    0.13
    Act Density 0.004%

    No Known Activations