INDEX
Explanations
architectural structures or terms
words related to architecture
New Auto-Interp
Negative Logits
lling
-0.85
liking
-0.72
ensing
-0.69
zzi
-0.68
vec
-0.68
TPS
-0.68
nder
-0.68
zzle
-0.66
cca
-0.66
instein
-0.65
POSITIVE LOGITS
itect
1.52
ief
0.98
ocobo
0.91
ipel
0.91
opoulos
0.91
osaurs
0.90
arch
0.84
stadt
0.83
bishop
0.80
sylvania
0.80
Activations Density 0.014%