INDEX
Explanations
variables and mathematical expressions within equations
New Auto-Interp
Negative Logits
isan
-0.16
lobal
-0.15
eric
-0.14
illos
-0.14
TEL
-0.14
.gb
-0.14
nty
-0.13
edia
-0.13
egin
-0.13
egt
-0.13
POSITIVE LOGITS
ld
0.41
ld
0.36
cd
0.33
dots
0.33
dots
0.29
LD
0.26
hd
0.25
ots
0.24
cd
0.23
LD
0.22
Activations Density 0.047%