INDEX
Explanations
references to historical events, entities related to warfare, and programming concepts
New Auto-Interp
Negative Logits
adpleegd
-0.96
فريبيس
-0.92
NUMX
-0.91
تقاوى
-0.85
puissiez
-0.78
GenerationType
-0.77
EDEFAULT
-0.76
webElementXpaths
-0.76
.",
-0.76
bbene
-0.75
POSITIVE LOGITS
<eos>
0.95
↵↵
0.87
:
0.80
↵
0.79
:
0.64
vs
0.58
is
0.58
=
0.56
↵↵↵
0.56
0.54
Activations Density 2.473%