INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ingen
    -0.78
    md
    -0.70
    agree
    -0.70
    Ø©
    -0.69
    ledge
    -0.68
    olas
    -0.68
    iths
    -0.68
    tics
    -0.67
    behind
    -0.67
    casters
    -0.66
    POSITIVE LOGITS
     installment
    1.12
     few
    1.06
     couple
    1.03
     iteration
    1.00
     decade
    0.97
     batch
    0.95
     incarnation
    0.92
     edition
    0.92
     phase
    0.88
     portion
    0.86
    Act Density 0.548%

    No Known Activations