INDEX
Explanations
words related to physical locations or movements
mentions of physical locations or specific entities
New Auto-Interp
Negative Logits
MSN
-0.69
lez
-0.65
itri
-0.63
iversal
-0.61
etheless
-0.60
iannopoulos
-0.59
=-=-=-=-
-0.57
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.56
GST
-0.55
ailability
-0.55
POSITIVE LOGITS
Rockefeller
0.59
Gemini
0.51
Dickinson
0.51
Plum
0.50
Shap
0.49
ows
0.49
rook
0.49
ivist
0.48
prestige
0.48
Bone
0.47
Activations Density 1.771%