INDEX
    Explanations

    words related to legal and political contexts, especially related to individuals and their actions

    New Auto-Interp
    Negative Logits
    Interstitial
    -0.70
    ¥µ
    -0.67
    TERN
    -0.64
    âĸ¬âĸ¬
    -0.63
    animate
    -0.63
    frey
    -0.62
    stration
    -0.60
    ãĥ¬
    -0.60
    ORGE
    -0.59
    cemic
    -0.59
    POSITIVE LOGITS
    ernel
    0.88
    ed
    0.85
    irts
    0.82
    atchewan
    0.79
    ozy
    0.78
    edIn
    0.78
    mallow
    0.76
    er
    0.76
    itudinal
    0.75
    hoff
    0.71
    Act Density 5.366%

    No Known Activations