INDEX
Explanations
phrases related to questions and inquiries
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.09
3:0.21
4:0.08
5:0.02
6:0.09
7:0.06
8:0.05
9:0.04
10:0.17
11:0.09
Negative Logits
".[
-1.39
ゴン
-1.35
.[
-1.31
opol
-1.29
.}
-1.28
APD
-1.26
Ma
-1.26
forms
-1.25
mean
-1.25
."[
-1.25
POSITIVE LOGITS
audi
1.52
uninstall
1.49
cheat
1.47
crochet
1.39
1.36
1.31
docker
1.30
goto
1.30
eton
1.29
grep
1.28
Activations Density 0.084%