INDEX
Explanations
phrases related to causes and effects
New Auto-Interp
Negative Logits
Tatsache
-0.80
ppins
-0.78
للمعارف
-0.76
mellitus
-0.75
##
-0.74
קישורים
-0.73
Przypisy
-0.72
Betts
-0.71
ersham
-0.71
Silverman
-0.71
POSITIVE LOGITS
cause
2.12
Cause
2.06
CAUSE
2.02
cause
1.92
Cause
1.90
causes
1.89
Causes
1.89
causes
1.82
CAUSE
1.76
Causes
1.65
Activations Density 0.099%