INDEX
Explanations
function definitions and control structures, particularly in programming code
New Auto-Interp
Negative Logits
ochem
-0.54
nó
-0.45
isma
-0.43
an
-0.42
atro
-0.41
esos
-0.41
zkiem
-0.40
рованные
-0.39
trop
-0.39
cionar
-0.39
POSITIVE LOGITS
self
3.22
self
3.15
Self
2.42
Self
2.31
SELF
2.16
selves
1.96
SELF
1.85
Selbst
1.84
herself
1.74
zelf
1.69
Activations Density 0.054%