INDEX
Explanations
statistical significance in research results
New Auto-Interp
Negative Logits
Larger
-0.80
(>
-0.78
Larger
-0.77
الحره
-0.76
larger
-0.74
larger
-0.72
bigger
-0.71
|>
-0.69
Higher
-0.69
Bigger
-0.69
POSITIVE LOGITS
less
1.92
Less
1.61
Less
1.47
moins
1.45
menos
1.36
LESS
1.35
lesser
1.28
fewer
1.27
less
1.24
weniger
1.16
Activations Density 1.132%