INDEX
Explanations
elements related to conjunctions, quantities, and references to collections or lists
New Auto-Interp
Negative Logits
β
-0.24
ÎĴ
-0.23
bacteria
-0.23
bishop
-0.22
biology
-0.22
β
-0.22
beta
-0.21
Bishop
-0.21
.beta
-0.21
beta
-0.20
POSITIVE LOGITS
unbind
0.19
unb
0.19
worst
0.18
.unbind
0.17
Worst
0.17
wor
0.15
worse
0.15
iya
0.15
ITES
0.15
bum
0.15
Activations Density 0.287%