INDEX
Explanations
elements related to programming structures and code execution
Code snippets or code-related text
closing braces and punctuation
New Auto-Interp
Negative Logits
-1.32
niſſe
-1.30
ujednoznacz
-1.27
autorytatywna
-1.26
iſchen
-1.22
ſicht
-1.22
ſchaft
-1.21
iſen
-1.20
ſſung
-1.19
ésultats
-1.16
POSITIVE LOGITS
↵↵
0.70
.
0.59
1
0.54
2
0.47
0.46
↵
0.45
↵↵↵↵
0.44
-
0.44
}
0.41
0
0.39
Activations Density 0.120%