INDEX
    Explanations

    phrases related to accidents or mishaps

    New Auto-Interp
    Negative Logits
    IMATE
    -0.51
    HEET
    -0.49
    antasy
    -0.49
    lored
    -0.48
    抱着
    -0.47
    mayın
    -0.47
    érica
    -0.47
     willingness
    -0.47
    ایا
    -0.47
    forced
    -0.47
    POSITIVE LOGITS
     accidentally
    1.24
     accident
    1.11
    accident
    1.04
     ACCIDENT
    1.01
     Accident
    0.98
     inadvertently
    0.90
    Accident
    0.90
     tripped
    0.89
     acci
    0.87
     accidents
    0.86
    Act Density 0.336%

    No Known Activations