INDEX
    Explanations

    references to political parties and their dynamics

    New Auto-Interp
    Negative Logits
    <bos>
    -0.53
    "}")
    -0.48
     cewek
    -0.48
    "]))
    -0.46
     "))
    -0.43
    ]))
    -0.43
    ())))
    -0.43
    ']))
    -0.43
     '))
    -0.42
    })
    
    -0.41
    POSITIVE LOGITS
    Party
    1.15
     Party
    1.13
     PARTY
    1.12
    PARTY
    1.06
    party
    1.06
     party
    1.02
     Parties
    0.90
    Parties
    0.86
     partij
    0.79
     Particle
    0.78
    Act Density 0.021%

    No Known Activations