INDEX
    Explanations

    words related to important or significant concepts or ideas

    important concepts or significant elements within a text

    New Auto-Interp
    Negative Logits
    ews
    -0.71
     Tsukuyomi
    -0.71
    uthor
    -0.68
    asca
    -0.67
    AUT
    -0.67
     Bett
    -0.67
     Horses
    -0.66
     Mostly
    -0.66
     Chatt
    -0.66
     Fever
    -0.65
    POSITIVE LOGITS
    stone
    1.19
    stroke
    1.19
    stro
    1.01
    stones
    0.99
    */(
    0.90
    binding
    0.90
    wcs
    0.89
    hole
    0.89
    ring
    0.88
     key
    0.86
    Act Density 0.020%

    No Known Activations