INDEX
Explanations
Twitter handles
references to social media interactions and discussions
New Auto-Interp
Negative Logits
Springfield
-0.81
icent
-0.79
SQ
-0.77
taboola
-0.77
rog
-0.75
Gö
-0.71
Ur
-0.70
owl
-0.70
LR
-0.70
Dispatch
-0.69
POSITIVE LOGITS
Ben
2.44
Ben
2.34
BEN
1.96
ben
1.95
Benjamin
1.83
ben
1.82
Benn
1.23
Bern
1.05
Bennett
1.05
Beau
1.01
Activations Density 0.189%