INDEX
Explanations
names of specific entities or organizations
proper nouns and significant entities in the text
New Auto-Interp
Negative Logits
ÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤ
-0.76
stereotypical
-0.73
typical
-0.68
1986
-0.66
};
-0.66
Mean
-0.65
notations
-0.64
Typical
-0.64
1989
-0.63
Shutterstock
-0.63
POSITIVE LOGITS
intends
1.14
continues
1.12
announces
1.11
finally
1.11
has
1.08
awaits
1.07
joins
0.98
appears
0.97
welcomes
0.97
will
0.96
Activations Density 0.486%