INDEX
Explanations
attends to numeric scores from earlier tokens describing gymnastic performances
New Auto-Interp
Head Attr Weights
0:0.10
1:0.12
2:0.12
3:0.12
4:0.12
5:0.09
6:0.12
7:0.16
Negative Logits
>=",
-0.30
Hebron
-0.28
ValueGeneration
-0.28
afone
-0.28
cifix
-0.26
protectora
-0.26
esperienza
-0.26
unzel
-0.26
grun
-0.26
lycée
-0.26
POSITIVE LOGITS
AnchorTagHelper
0.30
السكان
0.30
InstrumentedTest
0.29
SpringRunner
0.28
principalTable
0.27
APOLIS
0.26
ModelExpression
0.26
moder
0.25
談社
0.25
doInBackground
0.25
Activations Density 0.019%