INDEX
    Explanations

    instances where "smile" is mentioned in different contexts

    references to smiling or expressions of happiness

    New Auto-Interp
    Negative Logits
    æ©Ł
    -0.83
    ferred
    -0.69
    aer
    -0.67
     Administ
    -0.66
    unes
    -0.65
    lay
    -0.65
    inventoryQuantity
    -0.64
    FER
    -0.63
    raped
    -0.62
    forums
    -0.61
    POSITIVE LOGITS
     smile
    1.03
     smiles
    0.93
     hello
    0.89
    creen
    0.89
     grin
    0.89
     goodbye
    0.87
     smiling
    0.83
    heet
    0.77
    eful
    0.74
     emot
    0.74
    Act Density 0.010%

    No Known Activations