INDEX
    Explanations

    instances of the word "simple" and phrases that describe simplicity

    simple and uncomplicated

    New Auto-Interp
    Negative Logits
    IBarButtonItem
    -0.80
     passim
    -0.79
     AssemblyCompany
    -0.66
    InputBorder
    -0.65
    @@@@@@@@
    -0.65
    timbangkan
    -0.65
    edero
    -0.64
     turquesa
    -0.62
    hyrchwyd
    -0.61
    MLLoader
    -0.61
    POSITIVE LOGITS
     SIMPLE
    1.09
    SIMPLE
    1.00
     simple
    0.99
     Simple
    0.97
    Simple
    0.91
     Plain
    0.89
     uncomplicated
    0.87
     simpl
    0.83
    simple
    0.82
     Simplicity
    0.82
    Act Density 0.091%

    No Known Activations