INDEX
Explanations
words related to influence, both positive and negative
references to influence, particularly its significance and effects in various contexts
New Auto-Interp
Negative Logits
leigh
-0.74
eared
-0.74
ITIES
-0.73
Quotes
-0.72
asar
-0.72
DEF
-0.72
ft
-0.72
adapt
-0.72
heart
-0.71
TAG
-0.71
POSITIVE LOGITS
pedd
1.32
exerted
1.00
influencing
0.94
shaping
0.94
cooker
0.87
sway
0.87
influence
0.86
exercised
0.80
enza
0.74
orship
0.74
Activations Density 0.061%