INDEX
Explanations
references to personal information, particularly related to births and deaths
New Auto-Interp
Negative Logits
Seattle
-0.18
Hanson
-0.18
Minneapolis
-0.18
Rochester
-0.17
Seattle
-0.17
Vancouver
-0.16
Minnesota
-0.16
Portland
-0.16
ieran
-0.16
orca
-0.16
POSITIVE LOGITS
Alabama
0.26
Georgia
0.25
Yaz
0.24
Alabama
0.23
Tennessee
0.23
Memphis
0.23
Georgia
0.22
Arkansas
0.22
Knoxville
0.21
abama
0.21
Activations Density 0.385%