INDEX
Explanations
mentions or instances of formal requests or demands
instances of the word "request" in various contexts
New Auto-Interp
Negative Logits
lasses
-0.77
nat
-0.70
aming
-0.68
kered
-0.62
Surv
-0.61
bart
-0.60
Sul
-0.60
Tycoon
-0.59
Haram
-0.59
cas
-0.59
POSITIVE LOGITS
requests
0.96
requesting
0.93
permission
0.91
Animation
0.89
request
0.89
irection
0.86
request
0.81
submitted
0.81
ioned
0.79
requested
0.78
Activations Density 0.045%