INDEX
Explanations
seemingly random characters and sequences
special characters or symbols
New Auto-Interp
Negative Logits
verse
-0.68
camps
-0.67
Tra
-0.67
exhibit
-0.66
hypnot
-0.66
retrospective
-0.65
ze
-0.65
camp
-0.64
shares
-0.60
logger
-0.60
POSITIVE LOGITS
ķ
4.36
ľ
1.98
Ĺ
1.95
ĸ
1.85
Ķ
1.84
µ
1.84
ĵ
1.82
¢
1.78
Ģ
1.78
Ħ
1.77
Activations Density 0.013%