INDEX
    Explanations

    mentions of the concept of freedom, especially freedom of expression

    terms related to freedom, particularly freedom of expression and speech

    New Auto-Interp
    Negative Logits
    eor
    -0.76
    ded
    -0.67
    mers
    -0.65
    nos
    -0.64
    DERR
    -0.63
    itated
    -0.63
    ented
    -0.62
    acan
    -0.62
    ents
    -0.62
     Dynasty
    -0.62
    POSITIVE LOGITS
     Fighters
    0.91
    bies
    0.88
    fighters
    0.82
     roam
    0.79
    fighter
    0.78
     freedoms
    0.78
     fighters
    0.78
    Reviewer
    0.77
     guaranteed
    0.74
    boot
    0.74
    Act Density 0.034%

    No Known Activations