INDEX
Explanations
comma-separated elements in a list
instances of personal pronouns and punctuation indicating speech or dialogue
New Auto-Interp
Negative Logits
elusive
-0.72
viability
-0.68
attractiveness
-0.65
adoption
-0.64
Landing
-0.63
lif
-0.63
kidnapping
-0.62
rendition
-0.62
ORN
-0.62
abduction
-0.61
POSITIVE LOGITS
essed
0.84
aca
0.74
felt
0.72
lived
0.70
pires
0.70
stood
0.69
ided
0.69
might
0.69
uld
0.69
should
0.68
Activations Density 0.206%