INDEX
Explanations
references to recovery or improvement in performance or status
New Auto-Interp
Head Attr Weights
0:0.07
1:0.04
2:0.05
3:0.07
4:0.04
5:0.06
6:0.04
7:0.03
8:0.38
9:0.08
10:0.06
11:0.03
Negative Logits
ignt
-1.48
interstitial
-1.46
dn
-1.42
pread
-1.39
missions
-1.39
humans
-1.39
orrow
-1.38
usting
-1.37
taboola
-1.37
Requ
-1.35
POSITIVE LOGITS
rug
1.47
Rez
1.46
DISTRICT
1.34
playbook
1.33
stride
1.32
precon
1.26
preset
1.24
melon
1.23
melody
1.23
blond
1.22
Activations Density 0.037%