INDEX
Explanations
words related to components or types of valves and similar mechanisms
New Auto-Interp
Negative Logits
net
-0.17
ner
-0.17
ctor
-0.17
misc
-0.16
nero
-0.16
nt
-0.16
tron
-0.15
ark
-0.15
emon
-0.14
nic
-0.14
POSITIVE LOGITS
ighbour
0.21
utral
0.21
uve
0.20
ighb
0.20
ymoon
0.19
ering
0.19
jad
0.19
ymous
0.18
ighbours
0.18
berger
0.18
Activations Density 0.048%