INDEX
Explanations
comparisons and comparisons of quantities
comparative and superlative adjectives indicating levels of quality or quantity
New Auto-Interp
Head Attr Weights
0:0.09
1:0.01
2:0.11
3:0.06
4:0.44
5:0.05
6:0.03
7:0.01
8:0.03
9:0.05
10:0.04
11:0.02
Negative Logits
Spray
-1.41
respectively
-1.38
='
-1.32
CHAT
-1.28
lation
-1.28
Released
-1.26
ling
-1.24
XV
-1.23
Sabha
-1.23
DH
-1.18
POSITIVE LOGITS
iffe
1.75
orks
1.62
arre
1.43
ワン
1.37
skirts
1.31
bia
1.30
ued
1.28
irlf
1.26
illon
1.26
irtual
1.25
Activations Density 0.085%