INDEX
Explanations
questions or requests for information or assistance
instances of the word "ask" and its variations
New Auto-Interp
Negative Logits
Ĥ¬
-0.79
Tigers
-0.76
Nanto
-0.66
ccording
-0.65
audi
-0.65
swing
-0.64
cutting
-0.64
Revision
-0.62
pite
-0.61
accompan
-0.61
POSITIVE LOGITS
rhet
1.10
naires
0.98
probing
0.96
questions
0.93
erville
0.84
asked
0.83
wered
0.82
politely
0.80
asking
0.78
FontSize
0.78
Activations Density 0.040%