INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    	day
    -0.07
     판매
    -0.07
     Byte
    -0.07
     srand
    -0.06
     Soft
    -0.06
     verir
    -0.06
     Luz
    -0.06
     compat
    -0.06
     chic
    -0.06
    .Merge
    -0.06
    POSITIVE LOGITS
     tells
    0.16
     tell
    0.12
     Tells
    0.11
     telling
    0.10
     told
    0.08
     Tell
    0.08
    Tell
    0.07
    0.07
     Tommy
    0.06
     Wolff
    0.06
    Act Density 0.025%

    No Known Activations