INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     UFOs
    -0.75
    76561
    -0.69
     inaccur
    -0.66
     777
    -0.66
     vanished
    -0.66
     disappeared
    -0.65
    Downloadha
    -0.64
     Hilton
    -0.64
     disappears
    -0.63
     UFO
    -0.63
    POSITIVE LOGITS
    utils
    1.09
    framework
    0.90
     import
    0.87
     namespace
    0.84
    lib
    0.84
    tools
    0.81
    util
    0.81
    common
    0.80
    parser
    0.80
    plugin
    0.80
    Act Density 0.049%

    No Known Activations