INDEX
Explanations
numerical values tagged with a specific unit symbol
specific numerical values, particularly those related to monetary amounts
New Auto-Interp
Negative Logits
GOODMAN
-0.84
yang
-0.80
eering
-0.76
ezvous
-0.75
endum
-0.71
gments
-0.71
hran
-0.70
orate
-0.69
ongyang
-0.69
chard
-0.68
POSITIVE LOGITS
ILCS
1.35
75
0.99
475
0.89
80
0.87
655
0.86
8000
0.85
875
0.85
680
0.85
ength
0.84
00000
0.84
Activations Density 0.025%