INDEX
Explanations
occurrences of the word "it" and variations or references around it in various contexts
New Auto-Interp
Negative Logits
ufs
-0.14
Záp
-0.14
ãĥĥãĥī
-0.14
axon
-0.14
andal
-0.14
Barg
-0.13
жÑĥ
-0.13
parer
-0.13
å»
-0.13
alive
-0.13
POSITIVE LOGITS
hard
0.47
difficult
0.46
harder
0.44
tough
0.43
diff
0.40
challenging
0.39
hardest
0.36
hard
0.36
tougher
0.36
diffic
0.36
Activations Density 0.117%