INDEX
Explanations
words related to exceptions, exclusions, or special permissions
terms and phrases related to exemptions in various contexts
New Auto-Interp
Negative Logits
ãĥ¥
-0.73
aces
-0.69
plays
-0.68
Roose
-0.67
Hurricanes
-0.64
KR
-0.63
Bras
-0.62
aptic
-0.62
Xiang
-0.62
Zeit
-0.61
POSITIVE LOGITS
exempt
1.15
exemptions
1.07
exemption
0.99
exempted
0.99
exempt
0.97
immunity
0.92
Reviewer
0.88
emption
0.83
untarily
0.82
icut
0.78
Activations Density 0.020%