INDEX
Explanations
phrases that contain the word "of" and focus on quantities or relationships involving "couple"
New Auto-Interp
Head Attr Weights
0:0.02
1:0.07
2:0.11
3:0.04
4:0.05
5:0.04
6:0.28
7:0.05
8:0.02
9:0.04
10:0.06
11:0.17
Negative Logits
sonian
-2.13
aucas
-1.60
berus
-1.34
Chair
-1.33
IELD
-1.33
��
-1.24
andra
-1.24
berman
-1.24
agonal
-1.22
FORE
-1.22
POSITIVE LOGITS
dozen
1.29
Expand
1.28
ttes
1.25
ript
1.20
��極
1.19
phrase
1.18
sentences
1.18
yard
1.17
acre
1.16
dozen
1.16
Activations Density 0.007%