INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    infeld
    -0.75
    bos
    -0.65
    enne
    -0.64
    cks
    -0.64
    Cra
    -0.63
     Chow
    -0.63
    roth
    -0.62
    IDA
    -0.62
    info
    -0.61
    inen
    -0.61
    POSITIVE LOGITS
    ousand
    1.15
     inning
    1.13
    teenth
    1.08
    irty
    1.07
     percentile
    0.98
    ousands
    0.97
     century
    0.95
     amendment
    0.94
     Amendment
    0.91
    ieth
    0.89
    Act Density 0.049%

    No Known Activations