INDEX
Explanations
phrases related to political figures and events
instances of possessive pronouns or indicators of ownership
New Auto-Interp
Negative Logits
OLOGY
-0.69
Capture
-0.64
EEE
-0.61
ship
-0.59
rolling
-0.59
LESS
-0.56
ATTLE
-0.55
graft
-0.55
citation
-0.55
ASED
-0.55
POSITIVE LOGITS
kaya
1.40
omething
1.33
ilver
1.29
poon
1.29
cript
1.28
aurus
1.23
ki
1.23
ullivan
1.22
hip
1.21
hire
1.21
Activations Density 0.209%