INDEX
Negative Logits
cross
-0.08
Cross
-0.08
causal
-0.07
_cross
-0.07
.Cross
-0.07
Cross
-0.07
operations
-0.07
cross
-0.07
-c
-0.07
NAND
-0.07
POSITIVE LOGITS
immigrants
0.17
immigration
0.16
immigrant
0.16
inmigr
0.15
immigr
0.13
Immigration
0.13
emigr
0.10
settlers
0.10
extranj
0.10
migrants
0.10
Activations Density 0.016%