INDEX
Explanations
numerical data related to percentages and statistics
New Auto-Interp
Head Attr Weights
0:0.06
1:0.06
2:0.05
3:0.12
4:0.04
5:0.11
6:0.06
7:0.04
8:0.02
9:0.10
10:0.10
11:0.19
Negative Logits
?'
-2.29
.''
-2.17
FORM
-2.14
!'
-2.13
DISTR
-2.10
.'
-2.06
*/
-2.04
TY
-2.03
..."
-2.03
''
-2.02
POSITIVE LOGITS
];
2.69
];
2.53
],
2.35
leans
2.19
Palest
2.18
],
2.13
].
1.99
dylib
1.95
Kat
1.93
](
1.93
Activations Density 0.002%