INDEX
Explanations
the negation of actions or qualities
negations or phrases expressing denial
New Auto-Interp
Negative Logits
stakes
-0.61
Eighth
-0.59
Frenzy
-0.59
Platform
-0.58
Quarterly
-0.58
Rounds
-0.58
Stead
-0.57
Crisis
-0.56
Techn
-0.56
Circuit
-0.55
POSITIVE LOGITS
hin
1.35
epad
1.33
ifying
1.22
ifies
1.13
icably
1.07
only
1.07
icing
1.05
ched
1.05
ices
1.04
ifier
1.03
Activations Density 0.081%