INDEX
Explanations
the start of a new section or significant break in the text
New Auto-Interp
Negative Logits
Datuak
-1.08
ftagPool
-0.97
principalColumn
-0.96
CreateTagHelper
-0.92
مشين
-0.92
يتيمه
-0.90
tableFuture
-0.90
'\\;'
-0.89
antMatchers
-0.87
fjspx
-0.86
POSITIVE LOGITS
↵↵↵
0.70
<eos>
0.68
↵↵
0.59
↵↵↵↵
0.58
cérebro
0.57
A
0.56
вед
0.56
芙
0.55
With
0.54
↵
0.52
Activations Density 0.080%