INDEX
Explanations
words associated with tall buildings and their architectural features
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.05
3:0.05
4:0.04
5:0.04
6:0.03
7:0.05
8:0.03
9:0.04
10:0.40
11:0.19
Negative Logits
learn
-1.37
RESULTS
-1.28
-1.24
METHOD
-1.23
Runes
-1.23
TPP
-1.20
Jar
-1.16
pora
-1.15
twitch
-1.15
rahim
-1.14
POSITIVE LOGITS
occupies
1.41
..........
1.41
occupant
1.35
displ
1.31
owered
1.31
Construction
1.28
canceled
1.24
adjacent
1.22
floats
1.21
collapsing
1.20
Activations Density 0.222%