INDEX
Explanations
phrases related to being at the center of something
references to central locations or pivotal concepts
New Auto-Interp
Negative Logits
ãĥīãĥ©
-0.80
apons
-0.78
IELD
-0.72
ugg
-0.67
Shotgun
-0.67
phies
-0.66
apon
-0.65
rentices
-0.64
areth
-0.64
IGHTS
-0.63
POSITIVE LOGITS
pieces
1.05
stadt
0.89
most
0.86
burst
0.81
piece
0.79
point
0.79
tenance
0.77
eous
0.75
fold
0.75
uve
0.75
Activations Density 0.015%