INDEX
    Explanations

    questions related to actions or decisions

    New Auto-Interp
    Negative Logits
    onz
    -0.69
    tun
    -0.69
    iza
    -0.66
    gem
    -0.66
    WAYS
    -0.66
    bis
    -0.64
    legram
    -0.64
    ldom
    -0.62
    bow
    -0.62
    charg
    -0.61
    POSITIVE LOGITS
     happens
    0.99
     happen
    0.96
     distinguishes
    0.90
     happened
    0.89
     transpired
    0.89
     awaits
    0.84
    ?),
    0.82
    ?",
    0.82
    ?
    0.80
     constitutes
    0.77
    Act Density 0.055%

    No Known Activations