INDEX
Explanations
references to legal or authoritative figures, particularly those with titles like "editor" or "solicitor"
words or phrases related to legal or professional roles
New Auto-Interp
Negative Logits
©
-0.75
©¶æ
-0.70
sett
-0.66
¥ŀ
-0.65
param
-0.64
GoldMagikarp
-0.64
sterdam
-0.62
CAST
-0.61
·
-0.61
understanding
-0.60
POSITIVE LOGITS
ious
1.21
itor
1.20
ial
1.04
itors
1.03
berus
1.03
ium
1.00
iously
0.99
IAL
0.92
ionage
0.88
ially
0.84
Activations Density 0.013%