INDEX
    Explanations

    keywords related to societal structures and health care topics

    New Auto-Interp
    Negative Logits
    .
    -0.71
    .
    
    -0.64
    ;
    -0.63
    -0.59
    .\\
    -0.53
    $.
    -0.53
    ().
    -0.53
    ".
    -0.53
    *.
    -0.51
    !.
    -0.50
    POSITIVE LOGITS
    들은
    0.99
    서는
    0.95
     itſelf
    0.94
    AndEndTag
    0.92
    BeginContext
    0.91
    리는
    0.91
    etheless
    0.91
     ***!
    0.90
    지는
    0.86
     myſelf
    0.86
    Act Density 1.244%

    No Known Activations