INDEX
Explanations
questions or requests for information directed towards a person or entity
New Auto-Interp
Negative Logits
Ĥ¬
-0.78
Scouting
-0.67
âĶĢâĶĢ
-0.66
EStreamFrame
-0.66
luaj
-0.66
cutting
-0.64
lim
-0.64
Lago
-0.62
absor
-0.61
enactment
-0.61
POSITIVE LOGITS
questions
1.29
rhet
1.27
naires
1.14
answered
1.09
probing
1.09
Questions
1.06
naire
0.98
question
0.96
asked
0.96
politely
0.93
Activations Density 1.324%