INDEX
Explanations
words related to political and environmental activism, particularly involving organizations and legal battles
New Auto-Interp
Negative Logits
egu
-0.82
xual
-0.77
ilers
-0.72
actionDate
-0.70
vous
-0.70
iler
-0.69
iling
-0.69
ebook
-0.68
xy
-0.67
err
-0.67
POSITIVE LOGITS
still
0.96
upright
0.87
ov
0.81
desks
0.70
atop
0.70
tall
0.68
stationary
0.64
halls
0.64
room
0.63
quo
0.63
Activations Density 0.031%