INDEX
Explanations
phrases related to legal and governmental actions
references to legal entities and government-related activities
New Auto-Interp
Negative Logits
lehem
-0.80
MON
-0.79
ubes
-0.78
ãĤ©
-0.78
LX
-0.75
RW
-0.75
Porn
-0.75
onday
-0.74
Kidd
-0.74
Monkey
-0.73
POSITIVE LOGITS
Ag
2.33
Ag
2.19
AG
1.80
ag
1.53
Agents
1.38
Agent
1.36
AG
1.35
Agency
1.33
agent
1.22
Agent
1.20
Activations Density 0.205%