INDEX
Explanations
prompts related to contacting or reaching out for information or assistance
New Auto-Interp
Negative Logits
everything
-0.19
everything
-0.17
portion
-0.15
Everything
-0.15
alis
-0.14
imb
-0.14
strap
-0.14
uku
-0.14
alte
-0.14
#End
-0.13
POSITIVE LOGITS
0.24
someone
0.22
headquarters
0.21
management
0.20
Customer
0.20
management
0.19
someone
0.19
whoever
0.18
somebody
0.18
him
0.17
Activations Density 0.209%