INDEX
Explanations
elements related to physical structures or locations
New Auto-Interp
Head Attr Weights
0:0.05
1:0.03
2:0.05
3:0.02
4:0.03
5:0.08
6:0.06
7:0.20
8:0.09
9:0.03
10:0.07
11:0.24
Negative Logits
policies
-1.40
tics
-1.33
religions
-1.33
rupted
-1.32
Administ
-1.32
soever
-1.31
doctrines
-1.31
prophets
-1.30
philosophers
-1.29
regimes
-1.27
POSITIVE LOGITS
Brush
1.48
sandwic
1.47
ruck
1.44
anse
1.42
iece
1.42
capacity
1.40
Wilmington
1.37
compressor
1.35
twin
1.34
Tub
1.32
Activations Density 0.059%