INDEX
Explanations
questions or statements posing a question
inquiries or questions posed within the text
New Auto-Interp
Negative Logits
rites
-0.86
orpor
-0.76
ufact
-0.71
gian
-0.64
odka
-0.62
gewater
-0.62
lett
-0.61
rylic
-0.61
corrid
-0.60
ols
-0.59
POSITIVE LOGITS
naires
1.73
naire
1.50
mark
1.09
posed
1.09
answer
1.02
answered
1.01
unanswered
1.00
asked
1.00
mark
0.98
arises
0.92
Activations Density 0.044%