INDEX
Explanations
the word "downtown" in various contexts
references to the concepts of "up" and "down" as well as their variations
New Auto-Interp
Negative Logits
BuyableInstoreAndOnline
-0.85
Mori
-0.79
UGC
-0.78
Canaver
-0.77
Http
-0.75
Franch
-0.75
REDACTED
-0.74
Closure
-0.73
Frag
-0.73
Spur
-0.73
POSITIVE LOGITS
ownt
1.05
downt
1.01
upt
0.98
eenth
0.94
imes
0.92
urned
0.91
upt
0.88
riter
0.87
itled
0.86
ivery
0.85
Activations Density 0.009%