INDEX
Explanations
words related to deriving benefits or advantages
phrases indicating some form of benefit or advantage
New Auto-Interp
Negative Logits
aval
-0.59
potato
-0.59
faults
-0.59
cerning
-0.58
Traps
-0.58
amb
-0.56
OVA
-0.56
nucle
-0.55
garage
-0.54
smokes
-0.54
POSITIVE LOGITS
iciary
0.93
benefit
0.89
doms
0.88
ĸļ
0.83
emale
0.82
onies
0.80
financially
0.76
benefiting
0.74
Marginal
0.74
CHAT
0.73
Activations Density 0.033%