INDEX
Explanations
strong requests or pleas
words related to begging or requests for something
New Auto-Interp
Negative Logits
UV
-0.71
Deploy
-0.70
neau
-0.69
deploying
-0.67
Incident
-0.67
uci
-0.65
Illum
-0.63
Tinker
-0.63
policies
-0.62
Topic
-0.62
POSITIVE LOGITS
beg
3.52
begging
3.03
begged
2.51
begs
2.37
Beg
1.70
Beg
1.51
begg
1.49
pleading
1.17
beck
1.15
pleas
1.09
Activations Density 0.020%