INDEX
Explanations
concepts related to factors and considerations affecting outcomes
New Auto-Interp
Negative Logits
include
-0.17
Include
-0.17
matching
-0.16
Matching
-0.15
matched
-0.15
aminer
-0.14
Everyone
-0.14
585
-0.14
dit
-0.13
contain
-0.13
POSITIVE LOGITS
combine
0.34
together
0.30
combination
0.28
combine
0.28
cons
0.27
cul
0.26
contrib
0.25
contrib
0.24
combination
0.24
add
0.24
Activations Density 0.205%