INDEX
Explanations
numerical values related to statistics and percentages
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.12
3:0.06
4:0.08
5:0.03
6:0.36
7:0.06
8:0.03
9:0.05
10:0.06
11:0.05
Negative Logits
gdala
-1.39
rightfully
-1.37
iscons
-1.31
cients
-1.30
ocument
-1.25
Leilan
-1.22
icians
-1.21
ateurs
-1.17
LCS
-1.14
inaction
-1.14
POSITIVE LOGITS
rc
1.50
nova
1.46
ogan
1.38
iev
1.34
bsp
1.30
notice
1.29
м
1.29
ilion
1.28
meier
1.28
sie
1.28
Activations Density 0.060%