INDEX
    Explanations

    physics/quantum physics

    New Auto-Interp
    Negative Logits
    asses
    -0.07
    	ct
    -0.07
     blocker
    -0.07
    	text
    -0.07
    του
    -0.06
     آمریکا
    -0.06
    tring
    -0.06
     Ir
    -0.06
     investigator
    -0.06
    े,
    -0.06
    POSITIVE LOGITS
     다양한
    0.08
    leftright
    0.07
     높은
    0.07
    $request
    0.07
    解决
    0.07
     sách
    0.06
     obsessed
    0.06
    健康
    0.06
    галтер
    0.06
     báo
    0.06
    Act Density 0.095%

    No Known Activations