INDEX
Explanations
terms related to clever or skillful actions
occurrences of the word "trick" and its variants
New Auto-Interp
Negative Logits
ccording
-0.73
ankind
-0.71
isot
-0.69
vation
-0.68
Domain
-0.67
olars
-0.67
Supplement
-0.64
Expend
-0.64
enrichment
-0.64
ndum
-0.64
POSITIVE LOGITS
ery
1.21
ster
1.06
tricks
1.02
eries
0.96
trick
0.96
aroo
0.94
sters
0.93
Trick
0.88
iest
0.86
door
0.86
Activations Density 0.023%