INDEX
Explanations
text related to political statements and actions regarding social issues
New Auto-Interp
Negative Logits
nakalista
-0.84
ьаж
-0.80
AccessorTable
-0.79
LookAnd
-0.77
-0.70
Dulles
-0.70
surla
-0.69
ReusableCell
-0.68
Pyrr
-0.68
المكان
-0.67
POSITIVE LOGITS
neighborhood
1.04
Neighborhood
0.87
neighbourhood
0.82
Neighborhood
0.80
neighborhoods
0.77
neighborhood
0.73
vecind
0.73
NEIGH
0.69
barrio
0.68
Neighbourhood
0.68
Activations Density 0.193%