INDEX
    Explanations

    questions and answers in a conversational format

    New Auto-Interp
    Negative Logits
    bootstrapcdn
    -0.78
     Италијани
    -0.76
    Espèce
    -0.72
     vPvB
    -0.71
     Italijanski
    -0.67
    felf
    -0.67
    ///</
    -0.63
    extAlignment
    -0.63
     تضيفلها
    -0.62
     arrang
    -0.61
    POSITIVE LOGITS
     how
    0.82
    How
    0.79
     what
    0.77
     How
    0.76
    What
    0.76
     why
    0.76
     What
    0.72
    Does
    0.67
    Why
    0.67
     Why
    0.65
    Act Density 0.103%

    No Known Activations