INDEX
Explanations
phrases indicating causation or potential consequences
occurrences of the word "lead" in various contexts
New Auto-Interp
Negative Logits
Mach
-0.76
apy
-0.73
eatures
-0.66
ongyang
-0.65
emis
-0.63
arma
-0.63
cube
-0.63
phis
-0.63
orrow
-0.62
ategor
-0.62
POSITIVE LOGITS
lead
1.05
lead
0.90
better
0.88
Lead
0.85
Leading
0.81
Lead
0.81
leads
0.75
leading
0.71
boards
0.70
ership
0.70
Activations Density 0.019%