INDEX
    Explanations

    Past tense verbs

    New Auto-Interp
    Negative Logits
     Sale
    -0.07
    Tube
    -0.07
    кість
    -0.07
    Search
    -0.07
     make
    -0.06
    ODEV
    -0.06
     cell
    -0.06
     wires
    -0.06
     pauses
    -0.06
    	Input
    -0.06
    POSITIVE LOGITS
     فت
    0.06
     kov
    0.06
     베스트
    0.06
    atham
    0.06
     jav
    0.06
     ferv
    0.06
     IRQ
    0.06
     zároveň
    0.06
    educ
    0.06
    0.05
    Act Density 0.025%

    No Known Activations