INDEX
Explanations
phrases containing non-English characters and a mix of English words
specific characters or symbols that may indicate encoded or corrupted text
New Auto-Interp
Negative Logits
boys
-0.63
accompan
-0.61
bearer
-0.60
packing
-0.59
multipl
-0.59
theless
-0.58
overload
-0.58
DragonMagazine
-0.55
besides
-0.55
watching
-0.54
POSITIVE LOGITS
´
1.41
¤
1.41
¶
1.39
²
1.36
¾
1.34
¬
1.30
¦
1.29
ĥ
1.27
¼
1.27
«
1.26
Activations Density 0.074%