INDEX
Explanations
the number "one" appearing in different contexts
instances of the word "one" in various contexts
New Auto-Interp
Negative Logits
cases
-1.16
projects
-1.04
lations
-0.93
values
-0.93
types
-0.92
ooks
-0.92
forms
-0.90
groups
-0.89
levels
-0.89
olds
-0.88
POSITIVE LOGITS
heck
1.01
month
1.01
hour
0.97
year
0.95
week
0.92
hell
0.91
hundred
0.88
inning
0.88
minute
0.87
hurdle
0.86
Activations Density 0.088%