INDEX
Explanations
proper nouns related to politics, economics, and classified information
references to specific individuals or names connected to significant events or honors
New Auto-Interp
Negative Logits
RD
-0.83
greg
-0.83
ral
-0.83
RG
-0.82
ull
-0.79
roth
-0.79
raltar
-0.79
RAD
-0.78
RO
-0.78
ril
-0.78
POSITIVE LOGITS
Casey
0.76
USE
0.74
etts
0.73
ante
0.73
anted
0.72
ants
0.70
Pam
0.69
iment
0.68
Sue
0.67
00
0.67
Activations Density 0.397%