INDEX
    Explanations

    phrases related to causation

    the word "thereby" and similar phrases indicating causation or consequence

    New Auto-Interp
    Negative Logits
     Serge
    -0.64
     Atkinson
    -0.64
     Freddie
    -0.62
    cer
    -0.61
     Kle
    -0.61
    ten
    -0.60
     Kelvin
    -0.60
     Columb
    -0.60
     Food
    -0.60
     Pepper
    -0.60
    POSITIVE LOGITS
     guiActiveUn
    0.99
    forth
    0.94
    hiba
    0.84
    ãĤ´ãĥ³
    0.80
     convol
    0.80
     guiActive
    0.77
     forfe
    0.75
     sidx
    0.75
     dwind
    0.73
     forfeit
    0.73
    Act Density 0.007%

    No Known Activations