INDEX
Explanations
proper nouns related to removal or extraction
references to political figures and actions related to removing them from power
New Auto-Interp
Negative Logits
PROV
-0.62
yssey
-0.61
pitched
-0.59
success
-0.59
nosis
-0.58
Drawn
-0.57
Wit
-0.57
EngineDebug
-0.57
onward
-0.57
renaissance
-0.57
POSITIVE LOGITS
altogether
1.51
entirely
1.10
from
1.05
from
0.94
redundant
0.90
veil
0.87
obsolete
0.87
pesky
0.84
FROM
0.81
inhib
0.81
Activations Density 0.387%