INDEX
Explanations
phrases indicating quantity or duration of time
phrases indicating quantities, particularly multiples of two
New Auto-Interp
Negative Logits
hess
-0.72
bryce
-0.71
Bomber
-0.67
Reviewer
-0.65
↵Âł
-0.64
Cole
-0.63
Clar
-0.63
Havana
-0.63
Cassidy
-0.62
TC
-0.62
POSITIVE LOGITS
hundred
1.09
thousand
1.09
dozen
1.07
million
0.89
thirds
0.88
dozen
0.87
billion
0.78
ousands
0.78
inches
0.74
fold
0.73
Activations Density 0.132%