INDEX
Explanations
verbs indicating actions or behaviors that someone is supposed to do or that involve making decisions
phrases indicating possession or obligation
New Auto-Interp
Negative Logits
nai
-0.73
Introduced
-0.70
Rated
-0.69
advertisement
-0.68
redes
-0.67
iership
-0.66
displayText
-0.65
predec
-0.64
Flavoring
-0.63
conclud
-0.63
POSITIVE LOGITS
him
1.55
them
1.49
us
1.17
THEM
1.14
HIM
1.05
me
1.02
him
1.00
someone
0.99
them
0.94
everyone
0.92
Activations Density 0.308%