INDEX
    Explanations

    questions about ethics and morality, especially related to societal and political circumstances

    New Auto-Interp
    Negative Logits
     uncin
    -0.76
     solidar
    -0.70
     notor
    -0.68
     platt
    -0.65
    Wię
    -0.65
     ideolog
    -0.65
     dises
    -0.64
     tomat
    -0.63
     robus
    -0.63
    Warto
    -0.62
    POSITIVE LOGITS
     $?
    0.76
     déploy
    0.73
     écout
    0.71
     prêtres
    0.70
     Juifs
    0.65
     chrétien
    0.64
     dédi
    0.64
    ?
    0.64
     lumineuse
    0.63
     yoksa
    0.63
    Act Density 0.165%

    No Known Activations