INDEX
    Explanations

    the word "broken" with varying degrees of severity

    phrases that include the word "broken."

    New Auto-Interp
    Negative Logits
    minist
    -0.79
    utterstock
    -0.75
    azor
    -0.75
    yss
    -0.75
    erva
    -0.75
    metics
    -0.74
    ickr
    -0.73
    idency
    -0.73
    itatively
    -0.72
    Reviewer
    -0.72
    POSITIVE LOGITS
    neck
    0.89
     bones
    0.86
     broken
    0.85
    bones
    0.79
    broken
    0.78
     necks
    0.75
     fracture
    0.73
     Broken
    0.71
     broke
    0.70
    breaks
    0.70
    Act Density 0.016%

    No Known Activations