INDEX
Explanations
phrases related to making information or actions publicly available
phrases related to public disclosure or statements made publicly
New Auto-Interp
Negative Logits
ective
-0.74
gypt
-0.71
sweeps
-0.69
endurance
-0.68
aos
-0.68
acements
-0.66
Maintenance
-0.65
order
-0.63
opal
-0.63
elta
-0.63
POSITIVE LOGITS
publicly
3.66
privately
2.33
openly
1.95
public
1.58
public
1.57
formally
1.38
PUBLIC
1.37
officially
1.35
anonymously
1.30
verbally
1.28
Activations Density 0.010%