INDEX
Explanations
references to warfare and conflict-related themes
Mathematical or symbolic notation
code comments and transformations
New Auto-Interp
Negative Logits
+#+#
-0.71
WriteLiteral
-0.58
Fick
-0.57
sanitaires
-0.55
Bands
-0.52
Bens
-0.50
météo
-0.49
Permit
-0.49
-0.49
Kontrola
-0.49
POSITIVE LOGITS
Transformers
0.85
TRANSFORM
0.83
Optimus
0.82
Auto
0.80
gatron
0.77
Transformers
0.76
transform
0.74
transform
0.74
TRANSFORM
0.74
Transformer
0.72
Activations Density 0.180%