INDEX
Explanations
questions or statements with a contemplative tone
phrases that include the word "question."
New Auto-Interp
Negative Logits
rites
-0.84
orpor
-0.75
âĹ¼
-0.72
gewater
-0.70
gian
-0.67
agre
-0.65
corrid
-0.63
Minor
-0.63
aband
-0.62
é¾
-0.62
POSITIVE LOGITS
naires
1.68
naire
1.37
unanswered
1.17
posed
1.06
answered
1.04
asked
0.98
answer
0.92
mark
0.92
arises
0.91
mark
0.89
Activations Density 0.043%