INDEX
Explanations
words related to causes and effects
New Auto-Interp
Negative Logits
Tatsache
-0.84
קישורים
-0.84
للمعارف
-0.74
mellitus
-0.70
Adri
-0.70
Betts
-0.69
Ema
-0.68
paddingVertical
-0.67
Fl
-0.66
Haller
-0.66
POSITIVE LOGITS
CAUSE
1.48
cause
1.42
Causes
1.41
Cause
1.40
Caus
1.39
causes
1.35
causes
1.34
caused
1.30
caused
1.28
cause
1.27
Activations Density 0.103%