INDEX
Explanations
phrases referring to specific objects or concepts that are important or significant to the person or group mentioned in the text
phrases emphasizing singular importance
New Auto-Interp
Negative Logits
ilings
-0.72
ashington
-0.69
utics
-0.68
ategor
-0.67
dinand
-0.66
ciples
-0.65
ornings
-0.65
zhen
-0.64
odus
-0.64
otypes
-0.64
POSITIVE LOGITS
conceivable
0.93
imaginable
0.82
elig
0.81
able
0.81
available
0.79
redeem
0.79
INO
0.78
ever
0.77
feasible
0.76
authorized
0.75
Activations Density 0.121%