INDEX
Explanations
government and organization domains
New Auto-Interp
Negative Logits
Mixin
0.39
💅
0.36
/?
0.35
Recommended
0.35
airtight
0.35
Plus
0.35
Clicked
0.34
OÜ
0.34
PlayerSelector
0.33
Spare
0.33
POSITIVE LOGITS
gov
0.67
gov
0.67
govt
0.55
cornell
0.53
gouv
0.52
harvard
0.50
Govt
0.48
europa
0.47
govt
0.45
uc
0.44
Activations Density 0.011%