INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _VERTICAL
    -0.09
    コピー
    -0.09
     copying
    -0.09
     headings
    -0.09
     copy
    -0.09
     Messi
    -0.08
     verticale
    -0.08
    	copy
    -0.08
    Copy
    -0.08
     copia
    -0.08
    POSITIVE LOGITS
     flush
    0.20
     flushed
    0.19
     Flush
    0.19
    flush
    0.19
    Flush
    0.18
    _flush
    0.15
    .flush
    0.15
     flushing
    0.14
    .Flush
    0.12
     अल
    0.10
    Act Density 0.005%

    No Known Activations