INDEX
Explanations
references to bans and regulations
references to prohibitions or restrictions
New Auto-Interp
Negative Logits
Generations
-0.73
IMAGES
-0.70
Zeit
-0.64
lycer
-0.64
Apostles
-0.64
rious
-0.62
Rhythm
-0.61
PROG
-0.60
Temper
-0.59
rendition
-0.59
POSITIVE LOGITS
ishment
1.22
hammer
1.08
hee
1.00
tering
0.96
ishing
0.94
zai
0.92
nered
0.88
jo
0.84
eful
0.84
ish
0.82
Activations Density 0.035%