INDEX
Explanations
references to a specific political figure named Cruz
mentions of the name "Cruz."
New Auto-Interp
Negative Logits
Seym
-0.74
à¨
-0.72
unseen
-0.70
ebin
-0.68
ochemical
-0.66
OX
-0.65
Finnish
-0.65
Bulgarian
-0.62
Pebble
-0.62
Ö¼
-0.61
POSITIVE LOGITS
Cruz
1.10
Cruz
1.10
omics
0.87
Rubio
0.80
yne
0.78
itus
0.77
ettes
0.76
anne
0.76
Care
0.74
Rafael
0.73
Activations Density 0.011%