INDEX
    Explanations

    elements related to conflict or competition

    New Auto-Interp
    Negative Logits
    	
    -0.17
    âĸ²
    -0.13
     
    -0.13
     ecstatic
    -0.12
    '
    -0.12
     indis
    -0.11
     emin
    -0.11
    -
    -0.11
     melod
    -0.11
     painfully
    -0.11
    POSITIVE LOGITS
     nackte
    0.18
     iParam
    0.14
    Âł↵↵
    0.14
     ìĹŃìĭľ
    0.14
    ’↵↵
    0.13
    /.↵↵
    0.13
     skoro
    0.13
     ?.
    0.13
    á¾
    0.13
    (non
    0.13
    Act Density 0.656%

    No Known Activations