INDEX
    Explanations

    expressions of regret or remorse

    New Auto-Interp
    Negative Logits
    House
    -0.47
     baum
    -0.46
    WC
    -0.46
     House
    -0.45
    dataSource
    -0.45
    Wu
    -0.45
    slidesToShow
    -0.43
    QS
    -0.43
     Helios
    -0.43
     Supply
    -0.42
    POSITIVE LOGITS
     regret
    1.08
     Regret
    1.02
    Regret
    1.00
     regretted
    0.93
     regrets
    0.90
     regrettable
    0.75
    RegressionTest
    0.68
    後悔
    0.67
     menyes
    0.65
    后悔
    0.64
    Act Density 0.003%

    No Known Activations