INDEX
Explanations
requests for information or action using the phrase "Can you" followed by a verb
New Auto-Interp
Negative Logits
yk
-0.76
furt
-0.73
DERR
-0.73
ukemia
-0.72
senal
-0.68
stanbul
-0.67
enez
-0.66
hung
-0.66
assic
-0.65
ĸļ士
-0.65
POSITIVE LOGITS
imagine
1.29
afford
1.18
please
1.16
PLEASE
1.08
quantify
1.05
explain
0.92
tell
0.91
conceive
0.90
assure
0.90
summarize
0.89
Activations Density 0.043%