INDEX
Explanations
mentions of specific individuals in news articles
names of political figures and notable individuals, particularly in contexts involving actions or events
New Auto-Interp
Negative Logits
ãĤ¦ãĤ¹
-0.76
izons
-0.74
inventoryQuantity
-0.71
?????-
-0.70
uries
-0.69
origin
-0.67
omatic
-0.66
olitics
-0.64
ãĤ¦
-0.62
Fi
-0.61
POSITIVE LOGITS
interacting
1.43
smiling
1.41
hugging
1.40
waving
1.37
grinning
1.36
reacting
1.35
behaving
1.34
laughing
1.33
walking
1.31
chatting
1.31
Activations Density 0.548%