INDEX
    Explanations

    phrases related to problem-solving or decision-making

    occurrences of the word "figure" and its variations

    New Auto-Interp
    Negative Logits
    ibaba
    -0.84
    00000
    -0.74
    rem
    -0.66
    nor
    -0.64
    rons
    -0.63
    ewitness
    -0.63
    VIDEO
    -0.63
    rary
    -0.61
    ription
    -0.60
    riott
    -0.60
    POSITIVE LOGITS
     prominently
    0.84
     out
    0.80
     skating
    0.80
    istically
    0.73
    omething
    0.66
    matically
    0.65
    heads
    0.64
    ħ
    0.62
    sonian
    0.62
     how
    0.60
    Act Density 0.033%

    No Known Activations