INDEX
Explanations
non-zero activation values that indicate significant content or entry points in the text
Code, programming, and scripting related tokens
code keywords and separators
New Auto-Interp
Negative Logits
++
-0.59
()
-0.46
_
-0.44
irchen
-0.44
%
-0.41
".
-0.41
[]
-0.40
AKER
-0.39
$.
-0.39
Commencez
-0.39
POSITIVE LOGITS
class
0.69
import
0.68
def
0.67
0.65
import
0.65
function
0.64
class
0.61
MigrationBuilder
0.60
public
0.60
def
0.60
Activations Density 0.438%