INDEX
    Explanations

    phrases indicating a lack or absence of something

    New Auto-Interp
    Negative Logits
    ricultural
    -0.51
    DataAnnotations
    -0.50
     eduardo
    -0.50
     megane
    -0.49
     Edu
    -0.48
    Oste
    -0.47
     Alfa
    -0.47
    Alfa
    -0.47
    Stewart
    -0.46
    ǜ
    -0.46
    POSITIVE LOGITS
    None
    1.34
    none
    1.34
     none
    1.31
     None
    1.31
     NONE
    1.08
    NONE
    1.03
     ninguno
    0.93
     neither
    0.69
     nessuno
    0.66
    neither
    0.65
    Act Density 0.114%

    No Known Activations