INDEX
Explanations
questions and their corresponding responses or answers
New Auto-Interp
Negative Logits
それでも
-0.65
therefore
-0.58
writeFieldEnd
-0.51
nonetheless
-0.49
Therefore
-0.49
therefore
-0.48
asimismo
-0.48
inoltre
-0.47
TemporalType
-0.47
nevertheless
-0.47
POSITIVE LOGITS
Actually
1.16
Nope
1.10
Nope
1.10
Actually
1.09
actually
1.06
actually
1.05
nope
0.99
Depends
0.93
Absolutely
0.91
nope
0.90
Activations Density 0.429%