INDEX
Explanations
words related to exploring deeply or investigating thoroughly
phrases related to exploration or deep engagement with topics
New Auto-Interp
Negative Logits
runners
-0.72
evidence
-0.71
Islamic
-0.70
1920
-0.68
urers
-0.67
orate
-0.66
Gary
-0.65
say
-0.64
istically
-0.61
Pakistan
-0.59
POSITIVE LOGITS
dive
1.12
Dive
0.99
diving
0.93
dives
0.91
hitter
0.85
delve
0.81
earthqu
0.80
xit
0.78
EStream
0.76
hower
0.76
Activations Density 0.015%