INDEX
Explanations
references to making things easier or more manageable
New Auto-Interp
Head Attr Weights
0:0.03
1:0.01
2:0.05
3:0.07
4:0.13
5:0.01
6:0.04
7:0.42
8:0.03
9:0.02
10:0.05
11:0.08
Negative Logits
vernment
-1.78
acons
-1.64
DonaldTrump
-1.51
aceutical
-1.50
cffff
-1.46
orah
-1.45
restricted
-1.45
redients
-1.42
iosyncr
-1.41
isance
-1.39
POSITIVE LOGITS
transition
1.84
transitions
1.66
breeze
1.65
smoother
1.62
passage
1.62
blended
1.57
smoot
1.54
groove
1.49
blending
1.40
easing
1.39
Activations Density 0.002%