INDEX
Explanations
conditional statements, especially regarding outcomes or effects
New Auto-Interp
Head Attr Weights
0:0.02
1:0.05
2:0.09
3:0.05
4:0.02
5:0.07
6:0.19
7:0.06
8:0.14
9:0.13
10:0.09
11:0.04
Negative Logits
picks
-1.07
boards
-1.04
chairs
-1.02
pires
-1.01
watches
-1.00
recorder
-0.98
board
-0.97
Norn
-0.97
hunts
-0.96
ascus
-0.95
POSITIVE LOGITS
ieu
1.31
etheless
1.18
ample
1.11
adversely
1.11
emi
1.10
oldown
1.09
suffice
1.05
terness
1.05
opian
1.04
rue
1.03
Activations Density 0.309%