INDEX
Explanations
references to reliance or dependency on others or things
New Auto-Interp
Head Attr Weights
0:0.04
1:0.01
2:0.07
3:0.10
4:0.20
5:0.03
6:0.28
7:0.06
8:0.04
9:0.03
10:0.04
11:0.04
Negative Logits
inav
-1.42
Samoa
-1.34
nexus
-1.26
ゴン
-1.26
Weather
-1.25
modernization
-1.21
drug
-1.20
osc
-1.19
antics
-1.19
scheduling
-1.17
POSITIVE LOGITS
hered
1.45
iably
1.38
oku
1.35
icably
1.33
Schwar
1.31
ourced
1.30
ered
1.27
altogether
1.24
plin
1.22
ware
1.20
Activations Density 0.009%