INDEX
Negative Logits
gimm
-0.07
sneakers
-0.07
silly
-0.07
dangers
-0.07
charity
-0.07
Playground
-0.07
Eggs
-0.07
.samples
-0.07
symp
-0.06
scraped
-0.06
POSITIVE LOGITS
Resolution
0.09
resolutions
0.09
Resolve
0.08
Resolution
0.08
resolve
0.08
resolve
0.08
resolution
0.08
resolution
0.08
dissolve
0.07
resolving
0.07
Activations Density 0.020%