INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    nuts
    -0.83
    nder
    -0.77
    icer
    -0.76
    ruciating
    -0.75
    vae
    -0.75
    ntil
    -0.71
    idth
    -0.67
    aneous
    -0.67
    ente
    -0.66
    ership
    -0.66
    POSITIVE LOGITS
    ographer
    0.88
     map
    0.87
     mapped
    0.78
     sheet
    0.78
     maps
    0.77
     outline
    0.76
     overlay
    0.74
    map
    0.73
    MAP
    0.72
     MAP
    0.72
    Act Density 0.011%

    No Known Activations