INDEX
Explanations
references to agents and agency in a narrative context
New Auto-Interp
Negative Logits
Krie
-0.74
Lili
-0.73
SWR
-0.72
battleship
-0.71
Lasso
-0.71
prehension
-0.71
bifur
-0.70
ths
-0.70
Leck
-0.69
bifurcation
-0.69
POSITIVE LOGITS
agents
2.30
Agents
2.23
agent
2.20
Agent
2.14
Agents
2.10
Agent
2.07
AGENT
2.03
agents
1.99
AGENTS
1.97
agent
1.94
Activations Density 0.064%