INDEX
Explanations
countries and political figures mentioned in a document
names and references to individuals in political contexts
New Auto-Interp
Negative Logits
ensibly
-0.79
pires
-0.71
encia
-0.66
Sphere
-0.66
ttle
-0.65
mire
-0.64
soever
-0.62
until
-0.62
pure
-0.61
mediate
-0.61
POSITIVE LOGITS
proverb
0.99
remark
0.94
saying
0.93
precedent
0.89
noting
0.89
similarities
0.88
example
0.86
accomplishments
0.86
preced
0.86
stating
0.85
Activations Density 0.434%