INDEX
Explanations
mentions of specific organizations or individuals
occurrences of the preposition "of."
New Auto-Interp
Negative Logits
respective
-0.72
collateral
-0.68
required
-0.68
placeholder
-0.66
accordingly
-0.66
peripher
-0.65
entit
-0.64
dehuman
-0.63
categor
-0.63
deval
-0.63
POSITIVE LOGITS
Alexandria
0.91
Georgetown
0.90
Bellev
0.88
icial
0.85
Providence
0.83
Anaheim
0.82
Syracuse
0.81
sky
0.81
Bethlehem
0.79
Wilmington
0.79
Activations Density 0.044%