INDEX
    Explanations

    rivalries and conflicts

    New Auto-Interp
    Negative Logits
     IE
    -0.09
     TVs
    -0.08
    设备
    -0.08
     ::↵
    -0.08
     أجهزة
    -0.08
     lagu
    -0.08
     YAML
    -0.08
    設備
    -0.08
     ETH
    -0.08
     RGB
    -0.07
    POSITIVE LOGITS
     resentment
    0.11
     revenge
    0.11
     rivalry
    0.11
     distrust
    0.10
     hatred
    0.10
     antagon
    0.10
     betrayal
    0.10
    previous
    0.10
     disdain
    0.10
     grievances
    0.09
    Act Density 0.094%

    No Known Activations