INDEX
    Explanations

    quoted speech or statements

    New Auto-Interp
    Negative Logits
    omer
    -0.06
     Eph
    -0.06
    omers
    -0.06
    anter
    -0.06
    ãĥ¼ãĥ«ãĥī
    -0.06
     Bol
    -0.06
    oth
    -0.05
     Hung
    -0.05
    rell
    -0.05
     LA
    -0.05
    POSITIVE LOGITS
    ãĥ«ãĤ¯
    0.07
    bsite
    0.07
    lick
    0.07
     çŁ³
    0.07
     Klo
    0.07
     hrom
    0.07
    eks
    0.07
    okud
    0.06
    ì¹
    0.06
     ç±
    0.06
    Act Density 0.017%

    No Known Activations