INDEX
    Explanations

    mentions of specific individuals in news articles

    names of political figures and notable individuals, particularly in contexts involving actions or events

    New Auto-Interp
    Negative Logits
    ãĤ¦ãĤ¹
    -0.76
    izons
    -0.74
    inventoryQuantity
    -0.71
    ?????-
    -0.70
    uries
    -0.69
    origin
    -0.67
    omatic
    -0.66
    olitics
    -0.64
    ãĤ¦
    -0.62
    Fi
    -0.61
    POSITIVE LOGITS
     interacting
    1.43
     smiling
    1.41
     hugging
    1.40
     waving
    1.37
     grinning
    1.36
     reacting
    1.35
     behaving
    1.34
     laughing
    1.33
     walking
    1.31
     chatting
    1.31
    Act Density 0.548%

    No Known Activations