INDEX
Explanations
references to legal, justice, and societal issues
topics related to legal and regulatory issues
New Auto-Interp
Negative Logits
Siber
-0.66
Disable
-0.64
Written
-0.62
iven
-0.62
ãĤ»
-0.60
ãĥķãĤ©
-0.59
ãĥĭ
-0.59
Dim
-0.58
Ox
-0.57
Fem
-0.57
POSITIVE LOGITS
deserve
0.88
tended
0.86
testified
0.78
tend
0.78
are
0.77
hesitate
0.77
rejoice
0.76
contend
0.76
allege
0.75
knows
0.74
Activations Density 1.048%