INDEX
Explanations
punctuation and numerical patterns
New Auto-Interp
Negative Logits
DockStyle
-0.59
total
-0.46
TagHelpers
-0.46
<strong>
-0.44
FRE
-0.42
c
-0.42
WARRANTIES
-0.41
(
-0.41
//
-0.41
After
-0.40
POSITIVE LOGITS
pleaſure
1.03
purpoſe
1.01
myſelf
1.00
itſelf
0.91
Jefus
0.89
Theſe
0.89
houſe
0.87
Diſ
0.87
Monfieur
0.85
uſe
0.84
Activations Density 0.467%