INDEX
    Explanations

    words related to uniformity or things being the same across different instances

    instances of the word "uniform" and its variations

    New Auto-Interp
    Negative Logits
    heimer
    -0.92
    UD
    -0.84
    ×Ļ×
    -0.80
    =-=-=-=-
    -0.79
    =-=-=-=-=-=-=-=-
    -0.74
    à
    -0.74
    slow
    -0.73
    ï¸
    -0.73
    udic
    -0.72
    udder
    -0.72
    POSITIVE LOGITS
     uniform
    1.05
     uniforms
    1.02
    iating
    1.00
    ially
    0.92
    itarian
    0.92
    ity
    0.92
    iated
    0.90
     insign
    0.87
     attire
    0.86
     worn
    0.85
    Act Density 0.009%

    No Known Activations