INDEX
    Explanations

    references to individuals known for their charitable actions

    New Auto-Interp
    Negative Logits
    ertainment
    -0.15
    heit
    -0.14
    sburg
    -0.14
    ãĤ²
    -0.14
    éħ
    -0.14
    ihu
    -0.14
    edd
    -0.13
    Wal
    -0.13
    å®ħ
    -0.13
    磨
    -0.13
    POSITIVE LOGITS
     Guinness
    0.14
    ims
    0.14
    olini
    0.14
    à¤Ĥà¤ľ
    0.14
    rape
    0.14
    eroon
    0.14
    ndl
    0.14
    .compat
    0.14
    simp
    0.14
    legs
    0.14
    Act Density 0.003%

    No Known Activations