INDEX
Explanations
the prefix "non-" in words
references to the term "non" in various contexts
New Auto-Interp
Negative Logits
Tycoon
-0.95
Orchestra
-0.79
Oaks
-0.77
Memories
-0.77
Spoon
-0.77
Sparrow
-0.76
Showdown
-0.76
Grind
-0.76
McGee
-0.76
Grill
-0.75
POSITIVE LOGITS
etheless
1.02
zero
1.01
chal
1.00
fiction
0.99
epad
0.99
aligned
0.96
basic
0.93
availability
0.93
resident
0.92
linear
0.91
Activations Density 0.021%