INDEX
Explanations
phrases emphasizing inclusion or universality
repetitive conjunctions indicating inclusivity or expansiveness
New Auto-Interp
Negative Logits
Corpus
-0.60
Authors
-0.59
Beg
-0.58
pec
-0.56
Pick
-0.56
pi
-0.53
Crunch
-0.53
Wire
-0.52
detectors
-0.52
Mess
-0.52
POSITIVE LOGITS
rogen
1.09
sund
0.93
rogens
0.91
ro
0.91
every
0.87
EVERY
0.84
everyone
0.80
every
0.80
ãĥ¼ãĥĨ
0.79
everything
0.77
Activations Density 0.049%