INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ")
    -0.08
    _AGENT
    -0.07
    يلم
    -0.07
     scorn
    -0.07
     ngại
    -0.07
    lingen
    -0.06
    WebDriver
    -0.06
    رى
    -0.06
    هدف
    -0.06
    ayd
    -0.06
    POSITIVE LOGITS
     cannabis
    0.16
     Cannabis
    0.13
    abis
    0.08
     marijuana
    0.07
     cannabin
    0.06
     cbd
    0.06
    <Scalars
    0.06
     Kumar
    0.06
    .cls
    0.06
    _FULL
    0.06
    Act Density 0.002%

    No Known Activations