INDEX
Explanations
proper names, especially names like "David"
mentions of individuals named David
New Auto-Interp
Negative Logits
cffffcc
-0.86
nces
-0.79
llor
-0.68
uyomi
-0.63
sites
-0.63
Reply
-0.62
ĻĤ
-0.62
SPONSORED
-0.59
ĺħ
-0.59
Flavoring
-0.59
POSITIVE LOGITS
Beckham
1.00
Ortiz
0.87
Silva
0.76
Bowie
0.74
ulla
0.67
Villa
0.67
Hernandez
0.65
guyen
0.64
angelo
0.64
anton
0.63
Activations Density 0.030%