INDEX
Explanations
mentions of influential figures/entities
references to influential individuals or entities
New Auto-Interp
Negative Logits
thia
-0.79
few
-0.76
©¶æ
-0.73
ft
-0.70
fter
-0.70
aked
-0.69
otide
-0.69
È
-0.69
VK
-0.69
gged
-0.69
POSITIVE LOGITS
influential
0.94
influ
0.90
endorsements
0.90
figures
0.82
personalities
0.81
influence
0.81
role
0.79
gossip
0.78
endors
0.74
pund
0.73
Activations Density 0.014%