INDEX
    Explanations

    words related to cause and effect

    references to the concept of "effect."

    New Auto-Interp
    Negative Logits
    idel
    -0.78
    oway
    -0.65
     SATA
    -0.63
    lehem
    -0.61
    bledon
    -0.60
    gio
    -0.59
    dar
    -0.59
    itars
    -0.59
    ritz
    -0.58
    nia
    -0.58
    POSITIVE LOGITS
     effect
    3.83
     Effect
    2.67
     effects
    2.40
    effect
    2.34
    Effect
    2.18
     Effects
    1.97
    effects
    1.85
     impact
    1.84
    Effects
    1.74
     affect
    1.56
    Act Density 0.015%

    No Known Activations