INDEX
Explanations
words related to proper nouns or names
specific names and references to individuals and characters
New Auto-Interp
Negative Logits
Mehran
-0.90
Lancet
-0.69
actionDate
-0.64
Seym
-0.63
remote
-0.62
zens
-0.59
Olympus
-0.59
permanent
-0.59
drivers
-0.59
binding
-0.58
POSITIVE LOGITS
laughs
1.34
laughed
1.23
disagrees
1.22
replies
1.21
replied
1.21
responds
1.20
chuckled
1.17
asks
1.15
smiles
1.12
agrees
1.12
Activations Density 0.438%