INDEX
Explanations
locations or establishments
phrases related to historical events, political figures, and significant happenings
prepositional phrases indicating relationships or dependencies
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.14
3:0.04
4:0.32
5:0.09
6:0.02
7:0.02
8:0.07
9:0.13
10:0.04
11:0.02
Negative Logits
jay
-1.65
itz
-1.61
voy
-1.55
brew
-1.54
ews
-1.54
blade
-1.52
ipolar
-1.52
cussion
-1.50
uter
-1.45
aunt
-1.43
POSITIVE LOGITS
principles
1.51
UA
1.50
fundamentals
1.50
tenets
1.47
foundations
1.43
Satoshi
1.36
premise
1.35
foundation
1.35
Revival
1.35
dogma
1.32
Activations Density 0.006%