INDEX
Explanations
tokens that represent code or programming elements, particularly in the context of data structures or APIs
New Auto-Interp
Negative Logits
ſelf
-0.92
houſe
-0.90
pleaſure
-0.86
Majefty
-0.82
myſelf
-0.81
itſelf
-0.81
BoxFit
-0.80
Eſ
-0.79
Efq
-0.78
cauſe
-0.77
POSITIVE LOGITS
Sucesor
0.54
M
0.48
bewerken
0.47
principalTable
0.47
nock
0.45
للاسماء
0.44
dica
0.44
فريبيس
0.43
makl
0.43
arca
0.42
Activations Density 1.718%