INDEX
Explanations
statements or declarations made by entities or individuals
occurrences of the word "said" and related statements in a news context
New Auto-Interp
Negative Logits
estern
-0.75
pes
-0.72
å¦
-0.72
cffffcc
-0.71
Friend
-0.69
odox
-0.67
ldom
-0.66
tnc
-0.66
zin
-0.66
é¾įå¥ij士
-0.66
POSITIVE LOGITS
its
1.18
itself
1.00
Its
0.87
they
0.83
yesterday
0.82
ITS
0.82
Its
0.81
goodbye
0.78
Wednesday
0.76
Thursday
0.75
Activations Density 0.131%