INDEX
Explanations
frequency adverbs that indicate commonality or emphasis
New Auto-Interp
Negative Logits
transvers
-0.48
AutoModerator
-0.48
Référence
-0.46
Stap
-0.45
installa
-0.43
États
-0.43
DrawerToggle
-0.43
Negara
-0.42
Yemen
-0.42
FBref
-0.42
POSITIVE LOGITS
mainly
0.92
Mostly
0.89
mainly
0.88
primarily
0.87
Mainly
0.87
Mostly
0.86
mostly
0.84
mostly
0.84
głównie
0.81
Primarily
0.80
Activations Density 0.303%