INDEX
    Explanations

    statements or phrases indicating a justification or explanation for a situation or action

    statements about justification or rationale

    New Auto-Interp
    Negative Logits
    semble
    -0.72
    Interstitial
    -0.66
    chin
    -0.65
     Sour
    -0.65
    inav
    -0.65
     Carbuncle
    -0.64
     Ping
    -0.63
    oba
    -0.62
    agra
    -0.61
     puck
    -0.60
    POSITIVE LOGITS
     why
    0.94
     justifying
    0.86
    abl
    0.85
    forward
    0.77
    why
    0.76
    Origin
    0.74
    pointers
    0.73
     WHY
    0.73
     justify
    0.73
    ="#
    0.71
    Act Density 0.022%

    No Known Activations