INDEX
Explanations
email addresses
mentions of social media handles or accounts
New Auto-Interp
Negative Logits
Faust
-0.72
Klaus
-0.68
Lans
-0.67
Robotics
-0.67
Chair
-0.64
Lazarus
-0.63
Poe
-0.63
Leonardo
-0.63
Tanks
-0.62
Flav
-0.62
POSITIVE LOGITS
gmail
1.11
daily
1.08
yahoo
1.00
news
0.96
national
0.96
morning
0.95
bleacher
0.94
medi
0.92
north
0.92
lat
0.91
Activations Density 0.025%