INDEX
Explanations
places and organizations mentioned in a global political context
conjunctions and connecting phrases in sentences
New Auto-Interp
Negative Logits
tains
-0.77
laughs
-0.77
Gets
-0.75
Kills
-0.72
itates
-0.68
izes
-0.67
Begins
-0.67
pires
-0.66
Written
-0.66
isible
-0.65
POSITIVE LOGITS
are
1.40
were
1.25
aren
1.24
deserve
1.17
have
1.13
specialize
1.10
weren
1.10
constitute
1.09
comprise
1.08
require
1.08
Activations Density 0.478%