INDEX
Explanations
terms related to making decisions or calculations based on certain criteria or data
phrases that refer to assessments or conclusions based on prior information or assumptions
New Auto-Interp
Negative Logits
zz
-0.61
é¾įå
-0.60
qv
-0.58
osen
-0.58
ese
-0.56
vision
-0.55
TD
-0.55
Kro
-0.55
ESE
-0.55
slit
-0.55
POSITIVE LOGITS
upon
1.24
solely
1.02
on
0.95
loosely
0.89
purely
0.83
upon
0.83
off
0.83
squarely
0.79
uate
0.78
primarily
0.76
Activations Density 0.038%