INDEX
Explanations
words related to being exempt or exceptions to rules or regulations
terms related to legal exemptions and exclusions
New Auto-Interp
Negative Logits
ãĥ¥
-0.72
Xiang
-0.67
Roses
-0.66
aces
-0.65
plays
-0.65
Stro
-0.64
Gest
-0.62
Roose
-0.62
kr
-0.61
KR
-0.61
POSITIVE LOGITS
exempt
1.19
exempt
1.10
exemptions
1.08
exempted
1.05
exemption
1.00
Reviewer
0.90
immunity
0.89
emption
0.86
untarily
0.84
icut
0.81
Activations Density 0.020%