INDEX
Explanations
phrases related to negative consequences or costs incurred
New Auto-Interp
Negative Logits
abad
-0.75
ortium
-0.71
kan
-0.71
oub
-0.69
oggles
-0.67
packed
-0.66
Clover
-0.65
rete
-0.65
alde
-0.65
binding
-0.64
POSITIVE LOGITS
detriment
0.90
expense
0.78
altar
0.76
thereof
0.76
othal
0.70
liest
0.69
inconvenient
0.65
ħĭ
0.64
disadvantage
0.63
taxpayers
0.63
Activations Density 0.013%