INDEX
    Explanations

    references to bans and regulations

    references to prohibitions or restrictions

    New Auto-Interp
    Negative Logits
     Generations
    -0.73
     IMAGES
    -0.70
     Zeit
    -0.64
    lycer
    -0.64
     Apostles
    -0.64
    rious
    -0.62
     Rhythm
    -0.61
     PROG
    -0.60
     Temper
    -0.59
     rendition
    -0.59
    POSITIVE LOGITS
    ishment
    1.22
    hammer
    1.08
    hee
    1.00
    tering
    0.96
    ishing
    0.94
    zai
    0.92
    nered
    0.88
    jo
    0.84
    eful
    0.84
    ish
    0.82
    Act Density 0.035%

    No Known Activations