INDEX
Explanations
instances of the number "one" followed by another word
references to singularity or the concept of "one"
New Auto-Interp
Negative Logits
ooks
-1.02
projects
-0.95
bits
-0.93
estones
-0.91
markets
-0.90
ourses
-0.90
nesses
-0.90
models
-0.90
scenes
-0.90
ships
-0.89
POSITIVE LOGITS
hundred
1.09
person
1.09
instance
1.04
exception
0.96
thing
0.96
dozen
0.96
occupant
0.93
thousand
0.90
ounce
0.88
aspect
0.87
Activations Density 0.092%