INDEX
    Explanations

    instances where the phrase "Better" is mentioned within the text

    references to the concept of "better" or improvements in various contexts

    New Auto-Interp
    Negative Logits
    trl
    -0.74
    eur
    -0.73
    ulating
    -0.73
    hips
    -0.72
    ettes
    -0.71
    eton
    -0.70
    heter
    -0.69
    encers
    -0.69
    wart
    -0.69
    entially
    -0.68
    POSITIVE LOGITS
     Than
    1.02
     Better
    0.96
     Faster
    0.95
    Better
    0.83
     than
    0.77
    idge
    0.76
    lihood
    0.76
    luaj
    0.74
     benches
    0.72
    than
    0.69
    Act Density 0.014%

    No Known Activations