INDEX
Explanations
comparisons and changes in statistics or percentages
New Auto-Interp
Negative Logits
lien
-0.16
Ìģc
-0.14
è¶³
-0.14
nger
-0.14
ãģĻãģİ
-0.13
abal
-0.13
åįģäºĶ
-0.13
มà¸Ń
-0.13
à¹ij
-0.13
rike
-0.13
POSITIVE LOGITS
just
0.18
almost
0.18
last
0.16
nearly
0.15
nil
0.14
171
0.14
allo
0.14
around
0.14
79
0.14
virtually
0.14
Activations Density 0.081%