INDEX
Explanations
patterns related to mathematical expressions and operations
New Auto-Interp
Negative Logits
']")
-1.05
"],
-1.04
"):
-1.03
autorytatywna
-1.02
"]);
-1.01
}")
-1.00
)"),
-0.99
</caption>
-0.99
"])
-0.99
"}},
-0.98
POSITIVE LOGITS
1
1.99
2
1.13
0
1.11
3
1.05
5
0.98
6
0.92
4
0.91
9
0.89
7
0.84
8
0.79
Activations Density 1.596%