INDEX
Explanations
dates and timestamps in posts
New Auto-Interp
Head Attr Weights
0:0.16
1:0.01
2:0.02
3:0.08
4:0.02
5:0.07
6:0.02
7:0.28
8:0.02
9:0.04
10:0.09
11:0.12
Negative Logits
surv
-2.82
ItemTracker
-2.27
compan
-2.15
efficients
-2.12
comr
-2.12
��極
-2.05
arettes
-2.04
=~
-2.02
paed
-2.01
Bet
-2.01
POSITIVE LOGITS
Cancel
2.48
xit
2.46
ategor
2.25
(@
2.16
partName
2.13
ctive
2.08
Nanto
1.94
audi
1.91
idel
1.90
guiName
1.88
Activations Density 0.006%