INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Drum
    -0.08
     hemorrh
    -0.07
     bare
    -0.07
     fire
    -0.07
     dry
    -0.07
     birth
    -0.07
     thus
    -0.07
    153
    -0.07
     Birth
    -0.07
     decreased
    -0.07
    POSITIVE LOGITS
     option
    0.17
     options
    0.14
    Option
    0.13
     Option
    0.13
     Options
    0.12
    options
    0.11
    _option
    0.11
     opción
    0.10
    option
    0.10
    -option
    0.10
    Act Density 0.042%

    No Known Activations