INDEX
Explanations
portions of text addressing or referring to the reader
references to the audience or reader
New Auto-Interp
Negative Logits
ipal
-0.90
ĸļ
-0.77
entimes
-0.74
Canaver
-0.72
icy
-0.69
ice
-0.67
unic
-0.66
arc
-0.64
ayne
-0.64
emon
-0.63
POSITIVE LOGITS
guys
1.36
yourselves
1.26
tub
1.26
gentlemen
1.07
RS
0.94
're
0.88
Tube
0.88
sir
0.86
yourself
0.85
ladies
0.81
Activations Density 0.162%