INDEX
    Explanations

    words related to politeness and etiquette

    terms related to politeness and etiquette

    New Auto-Interp
    Negative Logits
    ointed
    -0.81
    ARK
    -0.80
    runs
    -0.75
    pher
    -0.73
    razil
    -0.73
    arks
    -0.72
    assets
    -0.70
    yrs
    -0.69
    alone
    -0.69
    slave
    -0.68
    POSITIVE LOGITS
     polite
    1.17
     etiquette
    0.98
    iquette
    0.94
     polit
    0.83
     manners
    0.81
     banter
    0.80
     bourgeois
    0.79
     intellig
    0.76
     decency
    0.75
     politely
    0.75
    Act Density 0.009%

    No Known Activations