INDEX
Explanations
phrases related to progress or advancement
phrases or references to a prolonged duration or significant progress
New Auto-Interp
Negative Logits
agents
-0.84
iances
-0.78
selves
-0.76
packages
-0.76
redients
-0.75
CB
-0.74
illions
-0.72
icks
-0.71
osponsors
-0.71
leck
-0.70
POSITIVE LOGITS
sword
1.08
itud
1.05
overdue
1.01
term
0.96
ago
0.95
itudinal
0.94
lasting
0.94
slee
0.88
enough
0.86
swath
0.83
Activations Density 0.039%