INDEX
    Explanations

    words and phrases related to conflict or struggle

    New Auto-Interp
    Negative Logits
    agg
    -0.16
    alty
    -0.16
    ities
    -0.16
    heit
    -0.15
    upon
    -0.15
    appen
    -0.15
    icit
    -0.14
    aja
    -0.14
    cia
    -0.14
    ately
    -0.14
    POSITIVE LOGITS
     tooth
    0.21
    back
    0.21
     against
    0.18
    club
    0.17
    à¸Ĺาà¸Ļ
    0.17
    çīĻ
    0.16
    inh
    0.16
     Against
    0.16
     Tooth
    0.15
    ning
    0.15
    Act Density 0.029%

    No Known Activations