INDEX
Explanations
phrases related to controversy or conflict
phrases indicating moral dilemmas or ethical conflicts
New Auto-Interp
Negative Logits
ahu
-0.74
utral
-0.67
Mechdragon
-0.66
iple
-0.63
USS
-0.61
Js
-0.58
é¾
-0.58
Wire
-0.57
¬¼
-0.57
corrid
-0.57
POSITIVE LOGITS
namely
1.22
albeit
1.08
despite
0.94
viz
0.93
nor
0.92
especially
0.91
irrespective
0.91
regardless
0.90
lest
0.90
especially
0.88
Activations Density 0.386%