INDEX
Explanations
mentions of politician Dianne Feinstein
references to the politician Dianne Feinstein
New Auto-Interp
Negative Logits
ocation
-0.73
ized
-0.71
alis
-0.70
ocative
-0.70
oday
-0.69
paddle
-0.69
izations
-0.67
eers
-0.66
urgical
-0.66
urgy
-0.66
POSITIVE LOGITS
Feinstein
1.13
Dianne
0.85
Fe
0.84
éĥ
0.78
æł
0.75
Whe
0.75
BILITY
0.74
Nunes
0.73
recy
0.72
âĸĵ
0.72
Activations Density 0.035%