INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    159
    -0.08
     rotary
    -0.08
     methane
    -0.07
     Mega
    -0.07
     Prelude
    -0.07
     bok
    -0.07
    pek
    -0.07
    stok
    -0.07
    -0.07
     bic
    -0.07
    POSITIVE LOGITS
     pav
    0.08
     sekitar
    0.08
    arians
    0.08
     suns
    0.08
    0.07
    continent
    0.07
    领先
    0.07
     Columbia
    0.07
    mate
    0.07
     ij
    0.07
    Act Density 0.012%

    No Known Activations