INDEX
Explanations
phrases discussing challenging, problematic, or contentious issues
terms related to contentious or thorny issues in discussions
New Auto-Interp
Negative Logits
uthor
-0.82
ocard
-0.72
Excellence
-0.72
ucle
-0.71
oreal
-0.71
opath
-0.71
shelves
-0.70
elsen
-0.69
ugen
-0.68
stocked
-0.67
POSITIVE LOGITS
thorn
1.01
dispute
0.80
questions
0.78
ifact
0.75
disputes
0.75
debated
0.75
tar
0.74
20439
0.74
vex
0.73
debate
0.70
Activations Density 0.048%