INDEX
Explanations
references to news sources or headlines
instances of the letter "U" in various contexts
New Auto-Interp
Negative Logits
Noir
-0.90
CPC
-0.78
CDs
-0.72
bottleneck
-0.69
Preferred
-0.66
offline
-0.63
Bach
-0.62
Solitaire
-0.62
Pioneer
-0.62
jams
-0.62
POSITIVE LOGITS
seless
1.22
prising
1.15
nexpected
1.14
PDATED
1.11
pperc
1.02
berman
1.02
ntil
1.01
raviolet
1.00
zbek
0.99
CLA
0.99
Activations Density 0.042%