INDEX
    Explanations

    phrases or words related to the concept of "freedom"

    New Auto-Interp
    Negative Logits
    <bos>
    -1.37
    /*
    -0.89
    /**
    -0.80
    public
    -0.75
    /*
    -0.74
    
    
    -0.73
    //
    -0.71
    #
    -0.70
    ,
    -0.68
    .
    -0.67
    POSITIVE LOGITS
     freedom
    2.10
     Freedom
    2.10
     FREEDOM
    2.07
    freedom
    2.00
    Freedom
    1.96
     Minang
    1.91
     affor
    1.88
     accla
    1.88
     stockholm
    1.88
     bandung
    1.84
    Act Density 0.120%

    No Known Activations