INDEX
Explanations
numeric patterns in HTML-like content
punctuation marks and symbols in text
New Auto-Interp
Negative Logits
mathemat
-1.01
tremend
-0.92
fortun
-0.80
nodd
-0.77
confir
-0.76
paralyzed
-0.75
manif
-0.75
sophistic
-0.74
bounded
-0.73
unnecess
-0.70
POSITIVE LOGITS
</
0.86
&
0.85
gt
0.84
display
0.81
amp
0.80
lt
0.78
ohm
0.78
Tang
0.77
[/
0.76
\">
0.76
Activations Density 0.020%