INDEX
Explanations
phrases related to solidarity or unity
terms related to solidarity and community engagement
New Auto-Interp
Negative Logits
STON
-0.71
esters
-0.70
GER
-0.67
usc
-0.62
ger
-0.62
lass
-0.61
ram
-0.61
erick
-0.60
lasses
-0.60
sen
-0.59
POSITIVE LOGITS
arity
1.81
parency
0.88
isy
0.84
ilial
0.84
yip
0.80
ality
0.80
Transparency
0.76
ilitary
0.74
inity
0.74
ibilities
0.74
Activations Density 0.007%