INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    يانة
    -0.07
     hesitate
    -0.07
    >s
    -0.07
     remember
    -0.07
     POP
    -0.07
     capability
    -0.07
     Carl
    -0.06
     Karn
    -0.06
     Dresden
    -0.06
    にして
    -0.06
    POSITIVE LOGITS
     tax
    0.12
     Tax
    0.11
     taxes
    0.09
    Tax
    0.08
     TAX
    0.07
     Taxes
    0.07
    camatan
    0.07
    -tax
    0.07
    TextLabel
    0.07
    xc
    0.07
    Act Density 0.017%

    No Known Activations