INDEX
Explanations
references to large-scale activities or entities
references to large-scale situations or phenomena
New Auto-Interp
Negative Logits
oother
-0.89
idges
-0.85
icut
-0.77
oras
-0.77
ĪĴ
-0.77
ston
-0.77
rium
-0.75
cair
-0.74
phis
-0.71
phe
-0.71
POSITIVE LOGITS
fabrication
0.84
enter
0.80
scale
0.77
scale
0.77
production
0.75
deployment
0.74
fulfillment
0.74
replica
0.73
deployments
0.72
administrative
0.69
Activations Density 0.020%