INDEX
Explanations
numerical data and statistics related to social issues
New Auto-Interp
Negative Logits
Loose
-0.17
iche
-0.15
695
-0.15
Elev
-0.15
Stanton
-0.15
loose
-0.15
ause
-0.14
ret
-0.14
ÏĥÏį
-0.14
Wilde
-0.13
POSITIVE LOGITS
fcc
0.15
رÙĬÙģ
0.15
illet
0.15
rava
0.15
aby
0.15
atter
0.14
bia
0.14
fav
0.14
ented
0.14
CCI
0.14
Activations Density 0.118%