INDEX
Explanations
references to metal and metallic materials
New Auto-Interp
Negative Logits
es
-0.18
ester
-0.18
edar
-0.17
estro
-0.17
ety
-0.17
ez
-0.16
esi
-0.16
ÏĨή
-0.15
eyim
-0.15
ÙĨ
-0.15
POSITIVE LOGITS
licity
0.39
lica
0.36
lic
0.32
anguage
0.30
lico
0.27
working
0.26
urgical
0.26
urgy
0.25
workers
0.24
lo
0.24
Activations Density 0.012%