INDEX
Explanations
commands or instructions related to programming
New Auto-Interp
Negative Logits
sis
-0.88
>>\
-0.74
Merit
-0.72
abet
-0.71
fur
-0.69
kaya
-0.69
si
-0.66
vic
-0.64
present
-0.64
ple
-0.64
POSITIVE LOGITS
them
1.14
something
0.98
him
0.96
these
0.93
some
0.93
another
0.91
somebody
0.87
someone
0.86
THEM
0.85
lots
0.84
Activations Density 2.297%