INDEX
Explanations
phrases indicating alignment or support towards a particular side or position in a debate or conflict
references to different sides in arguments or disputes, particularly in a political context
New Auto-Interp
Negative Logits
lear
-0.86
frey
-0.82
ardy
-0.78
rss
-0.76
enegger
-0.73
htaking
-0.73
encers
-0.71
yssey
-0.71
untu
-0.70
imens
-0.70
POSITIVE LOGITS
kick
0.92
seams
0.83
seam
0.77
Za
0.76
side
0.76
sidelines
0.73
boards
0.70
side
0.70
board
0.70
altar
0.68
Activations Density 0.021%