INDEX
Explanations
proper nouns and dates in news-like articles
punctuation marks and indicators of breaking news or updates
New Auto-Interp
Negative Logits
endeavour
-0.84
lling
-0.75
dra
-0.72
glide
-0.71
teleport
-0.71
abandon
-0.70
apprentices
-0.69
equip
-0.69
Aval
-0.68
borrow
-0.67
POSITIVE LOGITS
SPONSORED
1.36
Anonymous
1.21
According
1.08
Examples
1.07
Interestingly
1.07
However
1.07
Anyway
1.05
Apparently
1.05
Also
1.04
Furthermore
1.03
Activations Density 0.966%