INDEX
Explanations
phrases related to authoritative statements or reports
pronouns referring to individuals, particularly in a possessive context
New Auto-Interp
Negative Logits
INCLUD
-0.72
otin
-0.70
import
-0.68
=\"
-0.66
Æ
-0.66
(?,
-0.64
Researchers
-0.63
Iranians
-0.62
vantage
-0.61
hn
-0.61
POSITIVE LOGITS
own
1.23
autobiography
1.14
keynote
1.08
introductory
1.05
memoir
1.04
remarks
0.96
maiden
0.92
commentary
0.90
Own
0.90
speech
0.90
Activations Density 0.103%