INDEX
Explanations
mentions of a specific political figure named "Rand Paul"
mentions of the individual Rand Paul
New Auto-Interp
Negative Logits
dism
-0.70
uated
-0.64
urally
-0.62
clay
-0.62
ysis
-0.60
ded
-0.59
sleepy
-0.59
marrow
-0.58
circadian
-0.58
drag
-0.58
POSITIVE LOGITS
olph
1.39
uin
1.18
wick
1.10
stad
1.04
olf
1.04
igan
0.93
alls
0.87
rand
0.86
emonium
0.85
yth
0.84
Activations Density 0.031%