INDEX
Explanations
numbers with a specific mathematical operation applied "+", as well as collections or groups indicated by the term "plus"
phrases indicating quantities or numerical values
New Auto-Interp
Negative Logits
rog
-0.77
Bates
-0.71
ked
-0.66
atography
-0.65
rium
-0.65
bird
-0.65
Hamm
-0.64
Lar
-0.64
YN
-0.64
rius
-0.63
POSITIVE LOGITS
ottest
0.77
percentile
0.69
IMAGES
0.69
verage
0.68
intersections
0.67
overload
0.67
mph
0.66
servings
0.65
ILCS
0.65
consecutive
0.65
Activations Density 0.033%