INDEX
Explanations
statements or written communications issued by individuals
references to official communications, such as statements, reports, and letters
New Auto-Interp
Negative Logits
Authors
-0.62
comes
-0.59
acists
-0.57
ances
-0.57
encounters
-0.57
ayed
-0.56
icides
-0.55
glers
-0.54
breeds
-0.54
chants
-0.53
POSITIVE LOGITS
titled
1.11
stating
1.11
outlining
1.09
indicating
1.04
saying
1.04
detailing
1.03
thanking
1.02
alleging
1.01
entitled
0.98
apologizing
0.98
Activations Density 0.219%