INDEX
Explanations
negative statements, specifically those with 'not' or 'did not'
phrases indicating negation or the absence of something
New Auto-Interp
Negative Logits
horizont
-0.68
proportions
-0.68
)=(
-0.66
Levels
-0.63
tons
-0.61
Isn
-0.61
you
-0.60
Gems
-0.60
ERG
-0.60
Tank
-0.59
POSITIVE LOGITS
officially
1.08
formally
1.01
explicitly
0.97
necessarily
0.97
publicly
0.95
definitively
0.93
confir
0.90
icably
0.90
exactly
0.82
disclose
0.82
Activations Density 0.243%