INDEX
Explanations
technical terms and metrics related to a scientific or analytical context
New Auto-Interp
Negative Logits
GEBURTS
-1.16
Савезне
-1.15
betweenstory
-1.04
Personendaten
-1.01
IsContent
-0.98
ویکیپدی
-0.98
ſelf
-0.96
Мексичка
-0.95
neſs
-0.93
تقاوى
-0.92
POSITIVE LOGITS
↵↵
0.68
[…]
0.60
).
0.57
'
0.53
↵↵↵
0.50
…
0.49
<eos>
0.48
)).
0.46
\
0.46
))).
0.44
Activations Density 23.570%