INDEX
    Explanations

    concepts related to individual freedom and autonomy

    New Auto-Interp
    Negative Logits
    cplusplus
    -0.15
    rique
    -0.15
    ä¸įè¶³
    -0.15
    wap
    -0.14
    imary
    -0.14
    umlu
    -0.13
    .Peek
    -0.13
    à¥Īà¤ł
    -0.13
    inecraft
    -0.13
     Exclusive
    -0.13
    POSITIVE LOGITS
     freedom
    0.82
     liberty
    0.69
     Freedom
    0.66
     freedoms
    0.65
    Freedom
    0.62
    èĩªçͱ
    0.58
    fre
    0.57
     independence
    0.57
     Ñģвоб
    0.54
     libert
    0.52
    Act Density 0.402%

    No Known Activations