INDEX
    Explanations

    phrases related to measurements or quantities

    New Auto-Interp
    Negative Logits
    loff
    -0.16
    /controllers
    -0.15
    lÃŃ
    -0.15
    λÏİ
    -0.14
    erif
    -0.14
    /backend
    -0.14
    LOSE
    -0.14
    วà¸ĩศ
    -0.14
    lein
    -0.13
    Enough
    -0.13
    POSITIVE LOGITS
     just
    0.20
     merely
    0.18
     around
    0.18
    à¥ĩà¤
    0.17
     minus
    0.16
     between
    0.16
     sorts
    0.16
       
    0.16
     slightly
    0.16
     only
    0.15
    Act Density 0.061%

    No Known Activations