INDEX
Explanations
adjectives related to importance or priority
keywords indicating the primary or dominant factors in various contexts
New Auto-Interp
Negative Logits
zanne
-0.84
oops
-0.82
earances
-0.77
ournals
-0.69
rentices
-0.69
rosso
-0.67
ldom
-0.67
rahim
-0.66
Lauder
-0.66
Row
-0.66
POSITIVE LOGITS
source
1.22
beneficiary
1.19
conduit
1.06
culprit
1.04
indicator
1.04
obstacle
1.01
predictor
0.99
reason
0.99
casualty
0.98
contributor
0.96
Activations Density 0.130%