INDEX
Explanations
words related to prevention or obstruction of actions or processes
phrases related to prevention or blockage of actions or conditions
New Auto-Interp
Negative Logits
çīĪ
-0.73
eria
-0.70
clair
-0.69
aback
-0.65
summar
-0.65
xtap
-0.65
fortunately
-0.63
hooting
-0.63
awaits
-0.63
etheless
-0.62
POSITIVE LOGITS
anymore
0.95
BuyableInstoreAndOnline
0.87
altogether
0.84
anything
0.82
harmful
0.74
any
0.73
Expend
0.67
violent
0.66
toxic
0.65
undue
0.65
Activations Density 0.153%