INDEX
    Explanations

    phrases indicating a comparison of increasing quantities or intensities

    the repetition of the word "and" indicating a continued list or a buildup of ideas

    New Auto-Interp
    Negative Logits
    aiden
    -0.64
    afety
    -0.63
    digy
    -0.63
    hoe
    -0.58
    aturday
    -0.58
    ONSORED
    -0.57
    dden
    -0.57
    edom
    -0.55
    oola
    -0.55
    ixie
    -0.54
    POSITIVE LOGITS
    rogens
    0.90
     farther
    0.84
     more
    0.82
     clearer
    0.81
     better
    0.81
    rogen
    0.81
     louder
    0.78
     stronger
    0.75
     faster
    0.74
    more
    0.73
    Act Density 0.027%

    No Known Activations