INDEX
Explanations
mentions of a specific person (Marco Rubio)
mentions of the name "Marco Rubio"
New Auto-Interp
Negative Logits
ritic
-0.87
sworth
-0.81
wich
-0.75
shire
-0.73
folk
-0.73
nikov
-0.72
suit
-0.72
ritis
-0.69
sterdam
-0.67
yer
-0.67
POSITIVE LOGITS
Polo
1.45
Rubio
1.34
Antonio
0.93
Gutierrez
0.88
Marco
0.85
xtap
0.82
Muss
0.79
Fal
0.79
imo
0.78
Cruz
0.74
Activations Density 0.035%