INDEX
Explanations
words containing the substring "reg"
references to regulations or regulatory concepts
New Auto-Interp
Negative Logits
overhead
-0.64
bondage
-0.64
Owl
-0.64
Meow
-0.64
firsthand
-0.63
Wicked
-0.63
ambush
-0.62
Onion
-0.61
peril
-0.61
Danger
-0.61
POSITIVE LOGITS
reg
4.28
REG
2.10
Reg
1.93
regation
1.75
REG
1.65
Reg
1.46
regulated
1.37
region
1.35
register
1.34
regulation
1.33
Activations Density 0.009%