INDEX
Explanations
terms related to legal and regulatory concepts, such as entrapment
words and phrases related to various forms of "ent" (such as entertainment, engagement, and arrangement)
New Auto-Interp
Negative Logits
Spears
-0.74
士
-0.73
Jenner
-0.72
Responsibility
-0.67
å§«
-0.66
strap
-0.65
BILITIES
-0.63
enthal
-0.63
BILITY
-0.61
Accountability
-0.60
POSITIVE LOGITS
renched
1.06
ourage
1.03
inence
0.99
rust
0.98
ailed
0.98
itled
0.93
rench
0.93
ropy
0.91
rained
0.90
uri
0.90
Activations Density 0.014%