INDEX
Explanations
websites and email addresses
mentions of geographical locations or specific places
New Auto-Interp
Negative Logits
Ninth
-0.81
Brill
-0.79
Tenth
-0.75
Qiao
-0.75
Span
-0.74
Britann
-0.73
Eighth
-0.73
Contract
-0.71
Cotton
-0.71
©¶æ¥µ
-0.71
POSITIVE LOGITS
icago
0.98
ifest
0.98
imore
0.97
music
0.97
dq
0.95
isf
0.95
fd
0.93
@
0.93
iverpool
0.92
atl
0.92
Activations Density 0.148%