INDEX
    Explanations

    concepts related to freedom, particularly in political and economic contexts

    New Auto-Interp
    Negative Logits
    arians
    -0.16
    lou
    -0.15
    riad
    -0.15
    .LENGTH
    -0.15
    py
    -0.15
    elastic
    -0.14
    stem
    -0.14
    ous
    -0.14
    lio
    -0.14
    Ñģп
    -0.14
    POSITIVE LOGITS
    bies
    0.29
    bie
    0.29
    bsd
    0.28
    -standing
    0.24
    zing
    0.23
    zes
    0.23
    -floating
    0.23
    /free
    0.22
    zers
    0.22
    zer
    0.21
    Act Density 0.049%

    No Known Activations