INDEX
Explanations
mentions of a specific type of individual
references to the genre of rap music and its related elements
New Auto-Interp
Negative Logits
ertodd
-0.77
Samar
-0.72
Lumpur
-0.72
Columb
-0.69
++++++++++++++++
-0.68
fman
-0.67
Lebanese
-0.67
Lauder
-0.65
BIT
-0.63
Templar
-0.62
POSITIVE LOGITS
acious
1.13
acity
1.00
itone
0.93
hett
0.91
heet
0.88
oline
0.87
aic
0.85
athon
0.84
mund
0.84
quet
0.83
Activations Density 0.012%