INDEX
Explanations
keywords related to computer commands and prompts
references to command-line operations and prompts
New Auto-Interp
Negative Logits
Ĥª
-0.80
Ukrain
-0.74
abouts
-0.65
Beir
-0.64
verning
-0.64
Econom
-0.63
اÙĦ
-0.63
Sect
-0.61
Torn
-0.61
akening
-0.60
POSITIVE LOGITS
prompt
1.26
line
1.11
line
1.10
Prompt
0.99
Line
0.97
substitution
0.93
injection
0.92
invocation
0.89
syntax
0.88
LINE
0.87
Activations Density 0.049%