INDEX
Explanations
mentions of the material "steel" in various contexts
references to the steel industry
New Auto-Interp
Negative Logits
sembly
-0.82
uate
-0.79
romeda
-0.75
Kard
-0.73
Niet
-0.72
DOE
-0.70
olulu
-0.70
etheless
-0.67
Lori
-0.64
itia
-0.63
POSITIVE LOGITS
Series
1.15
works
1.09
wool
1.04
workers
1.04
worker
0.93
steel
0.88
anguage
0.86
fish
0.85
heart
0.84
beams
0.83
Activations Density 0.017%