INDEX
    Explanations

    mentions of the word "Love" with varying degrees of activation

    the word "Love" and related uses in various contexts

    New Auto-Interp
    Negative Logits
    ocument
    -0.84
     todd
    -0.82
    ulhu
    -0.79
    acco
    -0.79
    ħĭ
    -0.77
     reluct
    -0.75
    aution
    -0.74
    monary
    -0.73
    emonium
    -0.71
    NRS
    -0.69
    POSITIVE LOGITS
    lihood
    1.25
    joy
    1.10
    birds
    1.03
    bird
    0.95
     Actually
    0.89
    good
    0.86
    watching
    0.83
    tsky
    0.83
    fully
    0.83
    hound
    0.82
    Act Density 0.028%

    No Known Activations