INDEX
Explanations
references to a specific political figure, potentially a senator or a chief minister
occurrences of the term "data."
New Auto-Interp
Negative Logits
tails
-0.74
paren
-0.67
liest
-0.66
enegger
-0.65
ONSORED
-0.64
iday
-0.62
crow
-0.62
Petraeus
-0.61
lessly
-0.60
glers
-0.60
POSITIVE LOGITS
ña
1.00
ñ
0.93
eus
0.92
hedral
0.92
hea
0.88
iba
0.87
illon
0.86
fter
0.85
ño
0.84
isy
0.83
Activations Density 0.025%