INDEX
    Explanations

    immigration

    New Auto-Interp
    Negative Logits
     cross
    -0.08
     Cross
    -0.08
     causal
    -0.07
    _cross
    -0.07
    .Cross
    -0.07
    Cross
    -0.07
     operations
    -0.07
    cross
    -0.07
    -c
    -0.07
     NAND
    -0.07
    POSITIVE LOGITS
     immigrants
    0.17
     immigration
    0.16
     immigrant
    0.16
     inmigr
    0.15
     immigr
    0.13
     Immigration
    0.13
     emigr
    0.10
     settlers
    0.10
     extranj
    0.10
     migrants
    0.10
    Act Density 0.016%

    No Known Activations