INDEX
    Explanations

    various forms of attribution and publication references in text

    New Auto-Interp
    Negative Logits
    _FC
    -0.16
     CPF
    -0.14
    _tf
    -0.14
    toi
    -0.14
    InOut
    -0.14
    bios
    -0.14
    گاÙĨ
    -0.14
    seau
    -0.13
     Andre
    -0.13
    mlink
    -0.13
    POSITIVE LOGITS
    .lib
    0.17
    oom
    0.16
    pun
    0.15
    aket
    0.15
    ox
    0.15
     pun
    0.15
    reeNode
    0.15
    sik
    0.15
    agy
    0.15
     Pun
    0.15
    Act Density 0.284%

    No Known Activations