INDEX
    Explanations

    phrases related to challenges and improvements

    New Auto-Interp
    Negative Logits
     Less
    -0.17
    Less
    -0.15
    ÅĽÄĩ
    -0.15
    rosso
    -0.14
     moins
    -0.14
     Lesser
    -0.14
    -less
    -0.13
     olsun
    -0.13
    ë¥
    -0.13
    least
    -0.13
    POSITIVE LOGITS
     even
    0.81
     further
    0.68
    even
    0.68
     EVEN
    0.66
     still
    0.61
     yet
    0.59
     Even
    0.56
    Even
    0.56
     jeszcze
    0.55
     еÑīе
    0.54
    Act Density 0.252%

    No Known Activations