INDEX
Explanations
phrases or sentences indicating legal allegations
terms related to legal claims or statements of assertion
New Auto-Interp
Negative Logits
perm
-0.79
ammy
-0.76
aa
-0.71
talk
-0.68
perty
-0.67
ots
-0.67
oppy
-0.67
onen
-0.66
ggies
-0.64
ajo
-0.62
POSITIVE LOGITS
alleges
1.26
allege
1.03
alleging
1.00
allegations
0.98
accuses
0.95
accusing
0.88
denies
0.85
accuse
0.83
accusations
0.81
accuser
0.80
Activations Density 0.009%