INDEX
Explanations
instances of the word 'trick'
mentions of "trick" and related terms in various contexts
New Auto-Interp
Negative Logits
olars
-0.74
ankind
-0.72
Hemp
-0.71
ccording
-0.70
Supplement
-0.70
isot
-0.67
Bok
-0.64
Domain
-0.63
vation
-0.63
undai
-0.61
POSITIVE LOGITS
ery
1.22
tricks
1.04
ster
1.00
trick
0.99
eries
0.91
Trick
0.90
sters
0.87
door
0.87
iest
0.84
aroo
0.83
Activations Density 0.020%