INDEX
Explanations
mentions of the word "the" in relation to content about laws, governance, and social issues
New Auto-Interp
Negative Logits
enden
-0.15
offending
-0.15
rado
-0.14
ære
-0.14
arel
-0.14
ternet
-0.14
rys
-0.14
_DISPATCH
-0.14
ENTITY
-0.13
edback
-0.13
POSITIVE LOGITS
result
0.39
product
0.36
product
0.30
result
0.28
Result
0.25
subject
0.24
stuff
0.24
resultado
0.24
consequence
0.24
Product
0.24
Activations Density 0.142%