INDEX
Explanations
phrases indicating the attainment of benefits, rewards, or positive outcomes
occurrences of the article "a"
New Auto-Interp
Negative Logits
anism
-0.92
endeavors
-0.74
Edit
-0.74
CI
-0.73
Measures
-0.71
ie
-0.69
agree
-0.68
aneously
-0.68
Area
-0.68
livious
-0.66
POSITIVE LOGITS
lot
1.36
bunch
1.12
glimpse
1.07
handful
1.02
plethora
1.02
few
0.98
huge
0.98
couple
0.92
sizeable
0.92
whopping
0.92
Activations Density 0.330%