INDEX
Explanations
references to a specific individual, "Donald J. Trump."
mentions of Donald Trump
New Auto-Interp
Negative Logits
ulhu
-0.72
ife
-0.66
Worker
-0.61
oph
-0.58
é¾įå¥ij士
-0.57
staff
-0.57
reditary
-0.57
Tycoon
-0.55
Bengal
-0.54
ances
-0.54
POSITIVE LOGITS
redits
0.67
thening
0.58
ermanent
0.54
////
0.53
notice
0.52
nai
0.52
JV
0.51
dot
0.51
tacit
0.50
--------
0.48
Activations Density 0.168%