INDEX
Explanations
words related to determination or decision-making
references to determining factors or criteria in various contexts
New Auto-Interp
Negative Logits
ĸļ
-0.88
athi
-0.79
oil
-0.79
nor
-0.77
angs
-0.75
blog
-0.72
BuyableInstoreAndOnline
-0.71
-0.71
onda
-0.70
mong
-0.70
POSITIVE LOGITS
outcome
1.08
optimal
1.02
whereabouts
0.99
direction
0.97
optimum
0.96
precise
0.94
effectiveness
0.93
appropriate
0.93
exact
0.91
accuracy
0.91
Activations Density 0.262%