INDEX
    Explanations

    Soccer/Football

    New Auto-Interp
    Negative Logits
     mmc
    -0.07
    lerle
    -0.06
     rift
    -0.06
    larla
    -0.06
    ่ง
    -0.06
    -elements
    -0.06
    分类
    -0.06
     confined
    -0.06
     засобів
    -0.06
     تلك
    -0.06
    POSITIVE LOGITS
     cree
    0.07
     hor
    0.06
     parfait
    0.06
     sue
    0.06
     VH
    0.06
    0.06
     hacks
    0.06
    "];↵↵
    0.06
    OLVE
    0.06
    	describe
    0.06
    Act Density 0.004%

    No Known Activations