INDEX
Explanations
negations in sentences
negations or expressions that indicate the absence of something
New Auto-Interp
Negative Logits
itiz
-0.77
Laun
-0.70
PDATE
-0.69
Gry
-0.68
lined
-0.66
Citiz
-0.63
Seasons
-0.63
Creative
-0.63
çĶŁ
-0.62
IGN
-0.62
POSITIVE LOGITS
necessarily
1.24
intend
1.13
deserve
1.12
belong
1.11
exist
1.09
bother
1.09
seem
1.09
condone
1.07
know
1.07
discriminate
1.01
Activations Density 0.108%