INDEX
Explanations
inquiries and questions raised in various contexts
New Auto-Interp
Head Attr Weights
0:0.45
1:0.02
2:0.14
3:0.09
4:0.02
5:0.05
6:0.02
7:0.03
8:0.02
9:0.02
10:0.09
11:0.01
Negative Logits
Sync
-2.87
Enhanced
-2.55
Pace
-2.31
Cord
-2.31
闘
-2.30
Morrow
-2.29
Backup
-2.27
backups
-2.27
oother
-2.26
vision
-2.23
POSITIVE LOGITS
answ
4.83
answer
4.77
answered
4.72
answers
4.68
Answer
4.48
unanswered
4.45
answered
4.38
answering
4.37
swers
4.34
questions
4.32
Activations Density 0.190%