INDEX
Negative Logits
().↵
-0.20
.↵
-0.19
%.↵
-0.17
。↵
-0.17
:↵
-0.16
():↵
-0.16
."↵
-0.15
."↵↵
-0.15
(),↵
-0.14
()));↵↵
-0.14
POSITIVE LOGITS
|
0.54
|↵↵
0.45
|↵
0.44
| ↵
0.43
|↵//
0.38
|\
0.38
)|
0.35
.|
0.35
]|
0.35
|
0.34
Activations Density 0.933%