INDEX
Explanations
phrases related to financial transactions and costs
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.05
3:0.03
4:0.15
5:0.03
6:0.07
7:0.39
8:0.02
9:0.03
10:0.09
11:0.05
Negative Logits
Appearance
-1.92
icrobial
-1.91
ailability
-1.81
SPONSORED
-1.80
iatric
-1.71
Temperature
-1.65
ocal
-1.65
PLIED
-1.64
Reviewer
-1.64
itness
-1.63
POSITIVE LOGITS
rewrite
1.84
reins
1.73
LW
1.72
Merge
1.69
undone
1.60
upstream
1.58
uranium
1.54
throttle
1.43
©
1.42
manuscript
1.41
Activations Density 0.002%