INDEX
    Explanations

    phrases indicating causation or potential consequences

    occurrences of the word "lead" in various contexts

    New Auto-Interp
    Negative Logits
    Mach
    -0.76
    apy
    -0.73
    eatures
    -0.66
    ongyang
    -0.65
    emis
    -0.63
    arma
    -0.63
    cube
    -0.63
    phis
    -0.63
    orrow
    -0.62
    ategor
    -0.62
    POSITIVE LOGITS
    lead
    1.05
     lead
    0.90
    better
    0.88
     Lead
    0.85
     Leading
    0.81
    Lead
    0.81
     leads
    0.75
     leading
    0.71
    boards
    0.70
    ership
    0.70
    Act Density 0.019%

    No Known Activations