INDEX
Explanations
mentions of specific official titles or positions, particularly those related to law or governance
references to government or organizational titles and positions
New Auto-Interp
Negative Logits
brace
-0.77
track
-0.74
flick
-0.72
invite
-0.69
tracks
-0.68
drib
-0.66
threaded
-0.64
ads
-0.63
tracking
-0.63
envy
-0.63
POSITIVE LOGITS
General
3.93
general
2.82
General
2.04
GENERAL
2.03
general
1.55
Major
1.23
GEN
1.22
Detailed
1.21
Generic
1.20
Minor
1.19
Activations Density 0.015%