INDEX
    Explanations

    questions and inquiries regarding understanding or decision-making processes

    New Auto-Interp
    Negative Logits
    ophenyl
    -0.49
     Mc
    -0.48
     elite
    -0.47
    dom
    -0.47
     bar
    -0.46
    nia
    -0.45
    sp
    -0.45
    bin
    -0.44
    manni
    -0.44
    lata
    -0.44
    POSITIVE LOGITS
     how
    2.10
     why
    1.73
     what
    1.55
    how
    1.54
    why
    1.47
     cómo
    1.37
    what
    1.36
    How
    1.27
     How
    1.24
     HOW
    1.21
    Act Density 0.252%

    No Known Activations