INDEX
Explanations
passive verb phrases
negative or absent states and contrasts
New Auto-Interp
Negative Logits
Ur
-0.78
ONSORED
-0.73
FTWARE
-0.73
EMBER
-0.72
KEN
-0.70
UTF
-0.68
2020
-0.67
ARR
-0.67
IDENT
-0.65
>:
-0.64
POSITIVE LOGITS
prolifer
1.06
everywhere
0.86
routinely
0.84
disproportionately
0.84
inherently
0.83
notoriously
0.82
typically
0.81
traditionally
0.80
generally
0.79
clustered
0.78
Activations Density 0.622%