INDEX
    Explanations

    phrases indicating causation or results

    New Auto-Interp
    Negative Logits
     '{@
    -0.67
     batalha
    -0.67
     entgegen
    -0.60
     escuchadas
    -0.60
     casó
    -0.60
     voraus
    -0.58
     strå
    -0.57
    styleable
    -0.57
     fluence
    -0.57
     enfans
    -0.55
    POSITIVE LOGITS
     resulted
    0.90
     caused
    0.80
     resulting
    0.80
    ToAction
    0.80
     increased
    0.79
     eventual
    0.78
    导致
    0.77
     causes
    0.76
     causing
    0.74
    caused
    0.72
    Act Density 0.347%

    No Known Activations