INDEX
Explanations
prompts related to subscribing or not subscribing to newsletters
terms related to privacy and notice policies
New Auto-Interp
Negative Logits
uana
-0.65
coord
-0.64
patriarch
-0.56
pires
-0.56
simul
-0.54
scapego
-0.53
mun
-0.53
sugg
-0.53
Js
-0.52
retri
-0.52
POSITIVE LOGITS
Privacy
0.70
0.68
Submit
0.65
Newsletter
0.64
disclaimer
0.60
ļéĨĴ
0.56
0.56
phrase
0.55
Divinity
0.55
antha
0.54
Activations Density 0.013%