INDEX
Explanations
numerical statistics or percentages in the text
New Auto-Interp
Negative Logits
LAN
-0.73
izont
-0.70
Broad
-0.69
aucus
-0.69
achu
-0.69
Activity
-0.68
redit
-0.68
Cra
-0.68
helps
-0.68
avior
-0.68
POSITIVE LOGITS
thousand
1.14
percent
1.03
een
0.99
cents
0.94
teen
0.92
hundred
0.91
eenth
0.89
million
0.84
pounds
0.84
dollars
0.79
Activations Density 0.049%