INDEX
Explanations
terms related to controversial or sensitive topics
phrases that indicate significant or contentious political issues
New Auto-Interp
Negative Logits
Edit
-0.81
assisted
-0.75
refunds
-0.72
edit
-0.72
EDIT
-0.71
Edited
-0.69
edits
-0.69
Ern
-0.68
ingham
-0.66
estyle
-0.65
POSITIVE LOGITS
fixture
1.23
staple
1.23
thorn
1.19
cornerstone
1.13
hotly
1.07
contentious
1.06
topic
1.06
hallmark
1.04
boon
1.03
centerpiece
1.00
Activations Density 0.135%