INDEX
Explanations
words related to influential figures or entities
references to influential individuals or entities
New Auto-Interp
Negative Logits
gged
-0.76
thia
-0.76
otide
-0.75
ft
-0.74
È
-0.74
ago
-0.73
á
-0.73
few
-0.72
AUT
-0.71
thus
-0.71
POSITIVE LOGITS
influential
0.96
endorsements
0.86
influ
0.85
personalities
0.83
influence
0.81
clout
0.80
figures
0.73
voices
0.73
ãĤ¼ãĤ¦ãĤ¹
0.71
enough
0.70
Activations Density 0.013%