INDEX
Explanations
questions or statements about decision-making and evaluation
phrases indicating the validity or appropriateness of actions and decisions
New Auto-Interp
Negative Logits
çīĪ
-0.80
details
-0.75
mares
-0.74
cember
-0.72
ãĥ©ãĥ³
-0.70
stares
-0.68
ciating
-0.68
moil
-0.67
few
-0.66
letal
-0.65
POSITIVE LOGITS
worthwhile
1.54
feasible
1.38
acceptable
1.34
warranted
1.32
worth
1.30
appropriate
1.28
adequate
1.28
necessary
1.28
meaningful
1.25
accurate
1.25
Activations Density 0.229%