INDEX
Explanations
names of politicians and their respective states
references to U.S. senators and their associated states or political affiliations
New Auto-Interp
Negative Logits
pim
-0.75
llah
-0.74
DragonMagazine
-0.71
glim
-0.69
Wenger
-0.68
emanc
-0.68
cms
-0.68
runway
-0.66
Reviewer
-0.66
IDF
-0.65
POSITIVE LOGITS
Kentucky
1.06
Florida
1.00
Ohio
1.00
Idaho
0.98
Minnesota
0.98
Tennessee
0.97
Arizona
0.97
Utah
0.96
Pennsylvania
0.95
Texas
0.95
Activations Density 0.081%