INDEX
    Explanations

    mentions of economic concepts and their impacts on society

    New Auto-Interp
    Negative Logits
    éĺħ读次æķ°
    -0.20
    ä¸ĭ载次æķ°
    -0.16
    çĽijåIJ¬é¡µéĿ¢
    -0.16
    ä¸ŃæĸĩåŃĹå¹ķ
    -0.15
    âĢŀN
    -0.14
    оби
    -0.14
     æĬķ稿æĹ¥
    -0.14
     (![
    -0.14
    âĢŀJ
    -0.14
    OrNil
    -0.14
    POSITIVE LOGITS
     ,
    0.22
     ;↵
    0.21
     ;
    0.21
    (
    0.21
     which
    0.19
     ,↵
    0.19
     :↵
    0.19
     :
    0.18
    Âł
    0.17
     Which
    0.17
    Act Density 2.580%

    No Known Activations