INDEX
Explanations
references to news sources and publication dates
New Auto-Interp
Head Attr Weights
0:0.05
1:0.03
2:0.11
3:0.07
4:0.15
5:0.14
6:0.04
7:0.07
8:0.09
9:0.13
10:0.05
11:0.03
Negative Logits
glim
-2.58
enthusi
-2.29
estab
-2.22
explorers
-2.19
relic
-2.18
synonymous
-2.17
exha
-2.16
relics
-2.13
bonded
-2.12
byss
-2.12
POSITIVE LOGITS
,[
3.97
["
3.88
:[
3.83
[
3.55
Accessed
3.49
[
3.27
][
3.21
[/
3.18
:]
3.16
."[
3.07
Activations Density 0.001%