INDEX
Explanations
instances of the verb "make" in various forms and contexts
New Auto-Interp
Negative Logits
iture
-0.17
ragaz
-0.17
itech
-0.16
umbo
-0.16
vic
-0.16
lc
-0.15
ANDOM
-0.15
icable
-0.15
onor
-0.15
еÑģÑĤÑĮ
-0.15
POSITIVE LOGITS
fun
0.28
faces
0.24
fun
0.21
light
0.21
eye
0.20
Faces
0.18
conversation
0.18
Fun
0.17
believe
0.17
.fun
0.17
Activations Density 0.066%