INDEX
Explanations
personal references pointing to the speaker
references to the speaker or self
New Auto-Interp
Negative Logits
Equality
-0.63
profits
-0.60
edged
-0.60
Confederation
-0.58
Governments
-0.58
earable
-0.57
ipedia
-0.57
Coalition
-0.57
Args
-0.56
Us
-0.55
POSITIVE LOGITS
lees
1.17
zzo
1.11
personally
1.09
adows
1.08
adow
1.03
andering
0.97
imei
0.95
cca
0.89
myself
0.88
gal
0.86
Activations Density 0.096%