INDEX
Explanations
verbs related to sharing or revealing information
terms related to revealing or withholding information and their consequences
New Auto-Interp
Negative Logits
click
-0.83
Round
-0.81
raid
-0.75
mite
-0.73
Catch
-0.72
ulate
-0.71
ENE
-0.71
flush
-0.70
urse
-0.70
inate
-0.69
POSITIVE LOGITS
relinqu
1.62
divul
1.61
dissemin
1.60
ascert
1.43
assimil
1.43
disav
1.41
disreg
1.40
solic
1.40
obliter
1.39
util
1.38
Activations Density 0.118%