INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Remove
    0.90
     konusunda
    0.84
    ੁਰ
    0.84
     Veuillez
    0.84
    remove
    0.83
    )}}\
    0.80
    :=\
    0.79
    According
    0.77
     linhas
    0.77
    Answer
    0.77
    POSITIVE LOGITS
     homeostasis
    0.72
     accru
    0.72
     enne
    0.72
    0.71
     thermometer
    0.69
     everyday
    0.68
     firmly
    0.68
     philosophy
    0.67
     metabolism
    0.67
     mamm
    0.67
    Act Density 0.027%

    No Known Activations