INDEX
Explanations
references to analog technology or concepts
terms related to analog technology and analogies
New Auto-Interp
Negative Logits
apers
-0.83
hov
-0.76
words
-0.72
cloth
-0.64
aper
-0.64
eve
-0.62
Volunte
-0.60
Adamant
-0.58
Won
-0.58
perse
-0.58
POSITIVE LOGITS
ues
1.31
ously
1.14
ical
1.13
ies
0.98
izable
0.89
uers
0.88
istered
0.87
imity
0.86
izes
0.85
eties
0.85
Activations Density 0.048%