INDEX
    Explanations

    phrases related to the number "two"

    the repetition of the word "two."

    New Auto-Interp
    Negative Logits
    ugu
    -0.82
    asta
    -0.76
    amaru
    -0.72
     Caption
    -0.69
    lus
    -0.68
    ashtra
    -0.68
    rir
    -0.67
    ICLE
    -0.65
    ULE
    -0.63
    needs
    -0.63
    POSITIVE LOGITS
     thirds
    1.60
     dozen
    1.13
     hundred
    1.06
     weeks
    1.04
     halves
    1.02
    fold
    0.99
    teenth
    0.95
     decades
    0.88
    teen
    0.88
    een
    0.88
    Act Density 0.129%

    No Known Activations