INDEX
    Explanations

    numeric values in the form of quantities

    occurrences of the word "two" and numbers indicating quantities or counts

    New Auto-Interp
    Negative Logits
    LER
    -0.70
    eah
    -0.67
    UGE
    -0.61
    indust
    -0.61
    asta
    -0.60
     ours
    -0.59
    grim
    -0.59
     Krish
    -0.58
    adish
    -0.58
    ampunk
    -0.58
    POSITIVE LOGITS
    peat
    1.14
     consecutive
    1.08
    teenth
    1.01
    eenth
    1.01
     apiece
    0.96
     successive
    0.92
     assists
    0.90
     interceptions
    0.89
    aciously
    0.86
    straight
    0.83
    Act Density 0.175%

    No Known Activations