INDEX
Explanations
words related to appropriation and misappropriation
instances and discussions of cultural appropriation
New Auto-Interp
Negative Logits
Bay
-0.69
lock
-0.69
Sau
-0.69
Box
-0.69
Oracle
-0.68
sail
-0.68
Solitaire
-0.68
Sussex
-0.68
dream
-0.68
Holmes
-0.67
POSITIVE LOGITS
appropri
1.45
appropriation
1.14
ropri
1.11
appropri
1.10
appropriated
1.04
appropriately
1.00
itures
0.93
appropriate
0.91
orrow
0.88
anity
0.86
Activations Density 0.019%