INDEX
Explanations
references to individuals and their roles in specific contexts
Appears before a name
male names and professions
New Auto-Interp
Negative Logits
Minaj
-0.51
Stalin
-0.50
findpost
-0.49
Kremlin
-0.47
Soviets
-0.46
Stalin
-0.45
europa
-0.44
THISDAY
-0.43
CONSIN
-0.43
Palin
-0.43
POSITIVE LOGITS
David
0.99
John
0.97
Mark
0.94
Michael
0.92
Brian
0.91
Mike
0.91
Mark
0.88
Steve
0.88
Paul
0.87
Kevin
0.86
Activations Density 2.247%