INDEX
Explanations
phrases related to sources of information or authority
phrases that convey reliance on evidence or data
New Auto-Interp
Negative Logits
icably
-0.71
ilyn
-0.70
assies
-0.70
"]=>
-0.69
ovember
-0.68
icer
-0.67
anke
-0.67
mos
-0.66
etsk
-0.66
ipeg
-0.66
POSITIVE LOGITS
assumption
1.08
premise
0.98
principles
0.96
principle
0.92
criteria
0.87
whims
0.87
assumptions
0.86
methodology
0.83
standpoint
0.80
hypothesis
0.78
Activations Density 0.313%