INDEX
    Explanations

    phrases related to replacement or substitution with something new

    phrases related to replacing old things with new alternatives

    New Auto-Interp
    Negative Logits
    awar
    -0.71
    ibrary
    -0.65
    Plot
    -0.62
    TOR
    -0.61
    ipl
    -0.60
    verbal
    -0.60
    warn
    -0.59
    banks
    -0.57
    Words
    -0.57
    tip
    -0.56
    POSITIVE LOGITS
     newer
    1.37
     new
    1.19
     simpler
    1.06
     softer
    1.04
     cleaner
    1.02
     healthier
    0.98
     safer
    0.97
     nicer
    0.93
     brighter
    0.91
     modern
    0.90
    Act Density 0.325%

    No Known Activations