INDEX
Explanations
quotations or dialogue within the text
New Auto-Interp
Negative Logits
ikit
-0.16
ñana
-0.16
hrom
-0.15
localctx
-0.15
اصÙĦÙĩ
-0.15
ago
-0.15
icha
-0.14
gili
-0.14
emode
-0.14
ابÙĬ
-0.14
POSITIVE LOGITS
forces
0.16
ido
0.16
od
0.16
ippers
0.15
igg
0.15
odds
0.14
exact
0.14
Zw
0.14
favor
0.14
ton
0.13
Activations Density 0.048%