INDEX
Explanations
words related to opposite sides or viewpoints
references to conflicting parties or perspectives in a discussion
New Auto-Interp
Negative Logits
cles
-0.69
Countdown
-0.63
Miko
-0.60
agna
-0.59
dehyd
-0.59
RON
-0.58
Berry
-0.58
Polaris
-0.58
Assist
-0.58
nce
-0.57
POSITIVE LOGITS
side
0.86
alike
0.83
sides
0.81
thereof
0.74
pace
0.72
cale
0.72
combatants
0.70
concede
0.69
equally
0.69
hust
0.69
Activations Density 0.018%