INDEX
    Explanations

    comparative terms indicating superiority or increased quantity

    New Auto-Interp
    Negative Logits
    Ire
    -0.69
    VICE
    -0.67
    LAB
    -0.64
     CTR
    -0.63
     largeDownload
    -0.62
     stead
    -0.61
     valve
    -0.60
    maid
    -0.59
     clearance
    -0.58
     centr
    -0.57
    POSITIVE LOGITS
    atos
    1.58
    assis
    1.43
    rax
    0.86
    osaurs
    0.85
    alan
    0.85
    ority
    0.85
    ormal
    0.83
    agar
    0.82
    atra
    0.82
    kees
    0.82
    Act Density 0.005%

    No Known Activations