INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     vanity
    -0.08
     Erotik
    -0.08
     personal
    -0.07
     Personal
    -0.07
    .Proxy
    -0.07
     Benz
    -0.07
     dubbel
    -0.07
     Herman
    -0.07
    ekin
    -0.07
     Token
    -0.07
    POSITIVE LOGITS
     column
    0.11
     columna
    0.11
    	column
    0.10
    (column
    0.10
    column
    0.10
     coluna
    0.10
    Column
    0.09
     columns
    0.09
     columnas
    0.09
    columns
    0.09
    Act Density 0.013%

    No Known Activations