INDEX
    Explanations

    phrases related to improvement or optimization

    mentions of the term "Better."

    New Auto-Interp
    Negative Logits
    eur
    -0.70
    orial
    -0.69
    verson
    -0.68
    wart
    -0.67
    trl
    -0.67
    htar
    -0.66
    ãĤ·ãĥ£
    -0.66
    ãĥĢ
    -0.65
    LS
    -0.64
    MIT
    -0.63
    POSITIVE LOGITS
     Better
    1.23
    Better
    1.00
     Faster
    0.88
     Than
    0.85
     Advice
    0.80
     Quality
    0.75
     Neigh
    0.74
     Worse
    0.74
     Rivals
    0.74
    luaj
    0.73
    Act Density 0.008%

    No Known Activations