INDEX
Explanations
words or phrases related to question and answer formats
references to the letter "Q"
New Auto-Interp
Negative Logits
anish
-0.77
Dispatch
-0.70
angered
-0.65
nown
-0.65
kins
-0.64
enos
-0.63
ldom
-0.62
orious
-0.60
enshr
-0.60
abet
-0.59
POSITIVE LOGITS
Q
3.64
Q
2.54
q
2.04
Qt
1.72
QR
1.62
q
1.59
QC
1.46
qt
1.41
AQ
1.39
QL
1.37
Activations Density 0.018%