INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    nuts
    -0.85
    icer
    -0.85
    ruciating
    -0.78
    aneous
    -0.76
    nder
    -0.75
    ntil
    -0.74
    weeney
    -0.74
    quished
    -0.72
    ente
    -0.71
    usters
    -0.71
    POSITIVE LOGITS
    ographer
    0.83
     map
    0.83
    map
    0.77
     maps
    0.73
    makers
    0.70
     Map
    0.70
    maker
    0.70
     tiles
    0.70
    hack
    0.70
    ãĤº
    0.70
    Act Density 0.012%

    No Known Activations