INDEX
Explanations
political affiliations
key terms related to political affiliations and class distinctions
New Auto-Interp
Negative Logits
Travels
-0.67
mun
-0.66
contagious
-0.62
Donation
-0.61
WARN
-0.61
Sno
-0.61
0010
-0.59
Riley
-0.59
disclaimer
-0.57
RNA
-0.57
POSITIVE LOGITS
ones
1.02
itto
0.81
anooga
0.75
versa
0.75
arge
0.74
bsite
0.73
SPONSORED
0.71
ngth
0.70
Ones
0.69
counterpart
0.69
Activations Density 0.517%