INDEX
Explanations
sentences that express significant events or moments
Sentences before enumeration
mathematical notation
New Auto-Interp
Negative Logits
läßt
-0.92
muß
-0.90
idéia
-0.77
daß
-0.77
http
-0.76
!!!!!
-0.76
!!!
-0.74
!!!!!
-0.72
!!!!
-0.72
!!!!
-0.71
POSITIVE LOGITS
Alongside
0.97
Alongside
0.93
“[
0.91
Luckily
0.86
Ultimately
0.85
enquanto
0.85
Notably
0.84
unsur
0.84
prioritizing
0.83
Ultimately
0.83
Activations Density 0.142%