INDEX
    Explanations

    phrases emphasizing the concept of freedom, particularly in relation to trade, religion, and association

    New Auto-Interp
    Negative Logits
    ãĥ¼ãĤ¯
    -0.16
    sty
    -0.15
    illery
    -0.14
    ÄĻż
    -0.14
    alto
    -0.14
    eva
    -0.14
    że
    -0.14
    lady
    -0.14
    à¸Ńà¸ĩ
    -0.14
    łí
    -0.14
    POSITIVE LOGITS
    boro
    0.17
    elix
    0.16
     Avery
    0.15
    lick
    0.15
    bsub
    0.15
    .constructor
    0.15
    бÑĸ
    0.15
     Lit
    0.14
    hue
    0.14
     str
    0.14
    Act Density 0.011%

    No Known Activations