INDEX
    Explanations

    phrases related to discovering or uncovering information

    phrases indicating the act of discovering or learning information

    New Auto-Interp
    Negative Logits
     oun
    -0.75
    Crystal
    -0.69
    ovich
    -0.69
    avorite
    -0.66
    ĸļ
    -0.64
    luster
    -0.64
    berus
    -0.62
    eries
    -0.61
    uga
    -0.59
    erate
    -0.58
    POSITIVE LOGITS
     about
    1.02
     why
    0.98
     how
    0.97
     WHY
    0.87
     whats
    0.86
     beforehand
    0.85
     exactly
    0.85
     what
    0.83
     whether
    0.82
     afterwards
    0.82
    Act Density 0.022%

    No Known Activations